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Introduction 

Caveolae  of  vascular  endothelium  provide  inherently  accessible  targets  as  well  as  a  pathway  for 
transporting  antibodies  across  the  endothelial  cell  barrier.  Monoclonal  antibodies  that  can  target  tissue-specific 
proteins  located  in  caveolae  permit  rapid  tissue-specific  delivery  in  vivo  and,  perhaps  more  importantly,  are 
transported  into  and  even  across  the  endothelium.  When  accessible  vascular  sites,  such  as  caveolae,  are 
targeted,  tissue-specific  uptake  of  targeting  antibodies  is  observed  in  just  30  minutes  with  single  tissue 
accumulation  of  80%  of  the  injected  drug  and  up  to  1 80-fold  more  than  controls.  Such  transport  pathways  may 
be  useful  in  improving  the  delivery  of  antibodies  conjugated  to  agents  such  as  chemotherapeutic  drugs,  toxins 
or  even  gene  vectors.  The  goal  of  the  proposed  research  was  to  test  the  hypothesis  that  re-directing  antibody 
probes  to  accessible  caveolar  targets  of  tumor  blood  vessels  (rather  than  to  the  breast  tumor  cells  themselves) 
increases  not  only  directed  delivery  to  solid  tumors  but  also  increased  penetration  and  accumulation  of  antibody 
probes  into  the  tumor  itself  for  greater  therapeutic  efficacy  with  fewer  side  effects.  The  focus  of  this  project  is  to 
discover  caveolar  proteins  that  are  induced  specifically  in  the  vascular  endothelium  of  breast  tumors  but  that  are 
not  expressed  in  normal  tissues.  We  intend  to  generate  a  caveolae-targeted  monoclonal  antibodies  that  may 
duplicate  the  efficient  transcytosis  across  the  lung  endothelium  observed  in  our  past  work  but,  in  this  case, 
exhibiting  breast  tumor  specificity.  The  targeting  caveolae  in  tumor  endothelium  may  prove  useful  in  vivo  to 
increase  the  efficacy  of  chemotherapeutic  agents  in  the  treatment  of  solid  breast  tumors. 

Progress  Summary 

Technical  Objective  1:  Proteomic  analysis  to  identify  and  characterize  tumor  caveolae-associated  proteins  that 
may  be  useful  for  tumor  vascular  targeting. 

Task  1 :  Completed.  Isolation  of  caveolae  from  breast  tumor  and  normal  tissues. 

We  have  successfully  optimized  the  experimental  parameters  for  isolating  luminal  endothelial  cell 
caveolae  from  various  normal  tissues  as  well  as  breast  tumor  tissue  from  both  rats  and  humans.  Using  these 
newly-established  protocols,  we  can  generate  sufficient  material  to  undertake  the  proposed  proteomic  analysis. 

Task  2:  Completed.  Comparative  protein  analysis  of  caveolae  isolates. 

We  have  successfully  performed  a  comparative  proteomic  analysis  of  endothelial  cell  caveolae  from 
breast  tumors  vs.  normal  tissues  using  mass  spectrometry  (MS)  of  proteins  extracted  from  the  caveolae  isolates. 
These  experiments  have  resulted  in  the  identification  of  numerous  proteins  within  these  isolates  that  appear  to 
be  induced  in  the  breast  tumor  caveolae.  A  total  of  7  MS  analyses  of  normal  and  breast  tumor  caveolar  proteins 
have  been  completed.  Part  of  these  results  have  recently  been  published  (Nature  429,  629,  2004).  They  are  also 
the  subject  of  a  second  paper  in  preparation. 

Task  3:  Completed.  Identification  of  candidate  tumor-specific  proteins  by  mass  spectrometry,  de  novo 
sequencing,  and  classic  chemical  microsequencing. 

We  have  identified  5  candidate  tumor-induced  proteins,  designated  TE1-5,  found  in  breast  tumors  but 
not  in  normal  tissues.  Greater  than  3  peptide  sequences  were  determined  for  each  protein  by  our  MS  analysis. 
Polyclonal  antibodies  raised  to  peptides  of  these  proteins  were  used  for  validation  to  confirm  their  tumor-  and 
caveolar-association.  We  find  significant  enrichment  of  TE1-5  in  luminal  endothelial  cell  plasma  membrane 
subfractions  (P)  isolated  from  tumor-bearing  tissues  relative  to  the  whole  tissue  lysates.  Western  analysis  from 
various  normal  organs  reveals  little  to  no  expression  of  TE3  and  TE4  in  normal  tissues,  some  expression  of  TE1 
and  TE2  on  normal  liver  endothelium,  and  some  expression  of  TE5  in  normal  kidney  and  liver  although  at 
levels  much  less  than  breast  tumor  endothelium.  When  we  immunoblotted  for  TE1-5  on  caveolar  membrane 
isolates  (V),  we  found  that  TE1,  TE3,  TE4,  and  TE5  do  indeed  appear  to  reside  in  caveolae.  By  Western 
analysis,  TE1  and  TE3  are  significantly  enriched  in  tumor  caveolae  while  TE2  is  found  enriched  in  tumor  P  but 
not  V  and  remains  in  luminal  endothelial  cell  plasma  membrane  fraction  stripped  of  caveolae  (P-V).  TE5  is 
present  in  V  but  not  enriched  there.  In  addition,  we  have  independently  identified  each  of  these  proteins  (TE1-5) 
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using  T-D  gel  maps  &  MS  analysis  of  membranes  isolated  from  various  solid  tumors  from  rat,  rabbit,  rhesus 
mqnkey  and  humans. 

Because  TE3  appears  to  be  highly  tumor-specific,  we  focused  our  efforts  on  further  validation  of  its  tumor- 
specificity  by  using  tissue  immunostaining  of  panel  of  normal  and  neoplastic  tissues.  TE3  is  expressed 
selectively  on  the  breast  tumor  vascular  endothelium  and  not  in  nearby  normal  tissue.  That  is,  TE3  is  detected 
on  the  blood  vessels  within  the  growing  tumor  whereas  there  is  no  labeling  of  normal  vasculature.  Other  tissues 
(heart,  brain,  kidney,  skeletal  muscle,  fat,  liver)  were  negative,  consistent  with  the  Western  analysis  described 
above.  We  expanded  our  validation  of  TE3  to  include  human  tissues.  Western  analysis  of  human  tumor  and 
normal  tissues  revealed  that  TE3  is  expressed  in  caveolae  not  only  in  breast  tumors  but  also  in  tumors  from 
kidney,  liver,  lung,  brain,  and  prostate  but  not  in  various  normal  tissues.  Thus,  the  tumor-induction  of  TE3  can 
be  demonstrated  with  antibodies  both  in  rat  and  human  solid  tumors  of  different  types.  Human  tissue 
immunostaining  using  TE3  antibodies  confirmed  its  tumor-specific  expression.  TE3  is  expressed  in  the  blood 
vessels  of  human  breast,  prostate,  and  liver  tumor  tissue  whereas  no  labeling  was  detected  in  adjacent  normal 
tissue  vasculature  (negative  staining  in  normal  brain,  lung,  and  liver  tissue).  Thus,  TE3  is  induced  quite 
selectively  in  caveolae  of  blood  vessels  of  various  human  solid  tumors,  including  breast,  liver,  and  prostate. 

In  the  last  year,  we  have  focused  our  efforts  on  identifying  and  cloning  the  protein  corresponding  to 
TE3.  We  have  now  identified  TE3  as  a  34  kDa  form  of  Annexin  A1  (AnnAl)  from  multiple  samples  by  MS 
analysis:  i)  as  a  band  found  in  tumor  but  not  normal  caveolae  by  1-D  gel  analysis;  ii)  as  a  unique  spot  on  a  2-D 
gel  from  caveolae  isolated  from  tumor  but  not  normal  tissue;  and  iii)  in  TE3  antibody  immunoprecipates  from 
silica-coated  luminal  endothelial  cell  plasma  membranes  (P)  isolated  from  tumors.  In  addition  to  tandem  MS, 
we  have  also  detected  AnnAl  from  the  TE3  fingerprint  in  MALDI  analysis  of  tumor  P.  In  all,  we  have 
identified  have  identified  30  peptides  providing  ~58%  sequence  coverage  for  AnnAl. 

Technical  Objective  2:  Generate  and  characterize  monoclonal  antibodies  specific  for  caveolae-specific 
molecules  of  rat  breast  tumor  models. 

Task  4:  Completed.  Production  of  monoclonal  antibodies  in  mice  using  isolated  caveolar  membrane  fractions 
as  an  immunogen  or  specific  tumor-induced  proteins  identified  by  protein  analysis  (Task  3). 

To  better  characterize  TE3/AnnAl  expression  and  to  test  possible  targeting  of  solid  tumors  in  vivo,  we 
needed  high  affinity,  specific  probes.  Last  year,  we  reported  that  we  had  cloned  the  TE3  cDNAs  and  begun  to 
make  monoclonal  antibodies  to  rat,  mouse  and  human  TE3/AnnAl.  Using  these  cDNAs,  we  produced 
recombinant  protein  from  each  of  the  three  species  for  antibody  generation  and  screening  of  mouse  hybridomas. 
We  have  now  successfully  generated  a  panel  of  monoclonal  antibodies  (mAbs)  recognizing  rat,  mouse,  and 
human  TE3/AnnAl,  some  of  which  we  expect  to  use  in  an  upcoming  clinical  trial. 

Task  5:  Completed.  Establish  monospecificity  of  candidate  antibodies  by  Western  analysis  of  multiple  rat  and 
human  tumor  and  normal  tissues,  including  blood. 

We  have  established  the  monospecificity  of  our  new  TE3/AnnAl  mAbs  by  Western  analysis  of  luminal 
endothelial  cell  surface  membranes  and  caveolae  isolated  from  various  rat  and  human  tumors. 

Task  6:  Completed.  Establish  tumor  specificity  of  select  candidate  mAbs  by  analyzing  multiple  rat  and  human 
tumor  and  normal  tissues  by  both  Western  analysis  of  isolated  caveolae  and  by  tissue  immunostaining. 

We  have  established  the  tumor-specificity  of  our  new  TE3/AnnAl  mAbs  by  both  Western  analysis  of 
endothelial  caveolar  isolates  from  normal  and  neoplastic  tissues  and  by  tissue  immunostaining  of  rat  and  human 
normal  and  tumor  tissue  sections.  We  have  also  determined  that  these  new  mAbs  also  specifically  label  the 
blood  vessels  of  multiple  rat,  mouse,  and  human  tumor  types  (including  breast)  but  not  in  normal  tissues  similar 
to  our  results  with  the  polyclonal  antibodies  described  above  in  Task  3.  Work  using  these  new  antibodies  has 
recently  been  published  (Nature  429,  629,  2004). 

Technical  Objective  3:  Test  the  ability  of  developed  antibodies  to  target  specifically  rat  breast  tumors  in  vivo 
by  assessing  tissue  biodistribution  in  vivo  after  intravenous  injection. 
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Tqsk  7:  Completed.  Radiolabel  candidate  antibodies  with  I12'  and  assess  tumor  targeting  via  IV  injection  and 
gamma  counting  of  excised  tissues. 

To  test  whether  TE3/AnnAl  is  sufficiently  exposed  and  breast  tumor  vessel-specific  to  permit 
immunotargeting  in  vivo,  we  performed  bio-distribution  analysis  on  our  metastatic  breast  tumor  model,  whereby 
13762  breast  tumor  cells  are  injected  into  the  rat  tail  vein  and  allowed  to  seed  the  lungs  with  metastatic  breast 
tumors,  and  the  subcutaneous  tumor  model,  whereby  MDAMB435  cells  are  injected  subcutaneously  into  the 
anterior  back  region,  to  create  easily  observed  and  palpable  tumors  that  can  be  excised  for  analysis. 

We  injected  l2:,I-labeled  TE3/AnnAl  antibodies  intravenously  to  assess  biodistribution  in  vivo  and 
define  tissue  and  tumor  accumulation.  Significant  accumulation  of  TE3/AnnAl  antibody  was  observed  in  the 
whole  lungs  with  breast  tumors  (metastatic  model)  reaching  a  maximum  by  two  hours  (17%  ID/g).  In  breast 
tumors  dissected  away  from  normal  lung  tissue,  TE3/AnnAl  mAb  accumulation  reached  a  maximum  of  33% 
ID/g  of  tumor  at  2  hours.  When  injected  IV  into  normal  rats  without  breast  tumors,  AnnAl  antibodies  showed 
no  targeting  of  normal  organs,  at  levels  <1%  ID/g  of  tissue  (data  not  shown).  These  data  indicate  that  AnnAl  in 
tumor  endothelial  cell  caveolae  is  exposed  on  the  outside  of  the  cell  and  is  readily  accessible  by  IV  injection  for 
breast  tumor-selective  delivery  in  vivo. 

In  addition,  we  have  expanded  the  biodistribution  studies  proposed  in  the  application  (and  described 
above)  to  include  the  use  of  higher  resolution,  state-of-the-art  imaging  technologies.  We  have  used  an  A- 
SPECT  high  resolution  small  animal  gamma  scintigraphy  imager  (Gamma  Medica,  Northridge,  CA)  to  rapidly 
visualize  tumor  targeting  in  vivo  of  radiolabeled  TE3/AnnAl  antibodies  injected  intravenously.  This 
technology,  which  was  not  available  to  us  at  the  time  of  the  original  application,  is  far  superior  to  the  old 
methodology  originally  proposed  because  it  provides  a  continuous  viewing  of  tissue  uptake,  eliminates  the  need 
for  single  animal  per  data  point  reducing  the  number  of  test  animals  needed  and  permitting  longitudinal  studies, 
and  greatly  increases  the  quality  and  quantity  of  data  collected.  Radio-iodinated  TE3/AnnAl  mAbs  injected  into 
the  tail  vein  of  rats  with  metastatic  breast  tumors  in  the  lung  accumulated  in  the  tumors  with  a  maximal 
saturation  in  the  tumors  of  ~35%  of  the  injected  dose/gram  of  tumor  tissue,  y-scintigraphic  planar  images 
captured  1  -4  hours  postiniection  showed  distinct  foci  of  radioactivity  in  the  lungs  and  little  signal  elsewhere  in 
the  body.  Non-targeting  1  “^I-labeled  IgGs  did  not  target.  When  the  lungs  were  imaged  ex  vivo,  we  observed  l25I- 
TE3/AnnAl  antibody  accumulation  in  the  tumor  as  a  hot  spot  corresponding  to  visible  tumors.  Targeting  was 
prevented  by  30-fold  excess  unlabeled  TE3/AnnAl  IgG  but  not  control  IgG.  We  also  injected  l25I-AnnAl 
antibodies  into  the  tail  vein  of  mice  bearing  the  MDAMB435  subcutaneous  breast  tumor  model  and  imaged  the 
mice  3  hours  later.  Gamma  scintigraphic  imaging  showed  specific  accumulation  of  l25I-AnnAl  antibodies  in  the 
tumor  but  not  anywhere  else.  Thus,  AnnAl  antibodies  can  target  multiple  tumor  types  in  rat  and  mouse  models 
and  is  an  excellent  candidate  for  further  pre-clinical  testing.  These  data  have  recently  been  published  in  Nature. 

Key  research  accomplishments 

•  Isolation  of  caveolae  from  breast  tumor  and  normal  tissues. 

•  Comparative  protein  analysis  of  caveolae  isolates  by  MS  analysis  is  underway  and  producing  useful 
data. 

•  Identification  of  candidate  tumor-specific  proteins,  TE1-5,  by  mass  spectrometry,  de  novo  sequencing, 
and  classic  chemical  microsequencing. 

•  Validation  of  tumor-  and  caveolar-specificity  of  TE1-5  with  polyclonal  antibodies  to  peptides  from  TE1- 
5  by  Western  analysis  and  tissue  immunostaining  of  rat  and  human-derived  normal  and  neoplastic 
tissues. 

•  Identified  the  most  promising  candidate  tumor-specific  protein,  TE3,  as  Annexin  A1  (AnnAl)  and 
cloned  the  cDNA  for  recombinant  protein-  and  monoclonal  antibody  production. 

•  Production  of  new  monoclonal  antibodies  recognizing  mouse,  human,  and  rat  TE3/AnnAl . 

•  Established  monospecificity  of  new  TE3/AnnAl  monoclonal  antibodies  by  Western  analysis  of  multiple 
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'rat  and  human  tumor  and  normal  tissues,  including  blood. 

,  •  Established  tumor  specificity  of  multiple  new  TE3/AnnAl  monoclonal  antibodies  recognizing  mouse, 
rat,  and  human  TE3/AnnAl  by  analyzing  multiple  rat  and  human  tumor  and  normal  tissues  by  both 
Western  analysis  of  isolated  caveolae  and  by  tissue  immunostaining. 

•  Established  that  TE3/AnnAl  is  both  expressed  and  externally  exposed  selectively  in  endothelium 
growing  in  solid  tumors. 

•  Established  tumor  specificity  and  targeting  of  TE3/AnnAl  monoclonal  antibodies  by  in  vitro 
biodistribution  and  state-of-the-art  imaging  techniques  in  metastatic  and  subcutaneous  breast  tumor 
models  in  rat. 

Reportable  outcomes 
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,  Solid  Tumors”  (Alameda,  California) 
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2003  National  Cancer  Institute  Seminar,  “Proteomic  Mapping  of  Endothelium  and  Its  Caveolae  for  Targeting 
Single  Organs  and  Solid  Tumors  In  Vivo”  (Bethesda,  MD) 

2003  Stanford  University,  Division  of  Pulmonary  Medicine:  “Caveolae  Trafficking  and  Vascular  Proteomic 
Mapping  for  Organ-  and  Tumor-Specific  Targeting  in  Vivo”  (Stanford,  CA) 

2003  National  Cancer  Institute,  The  Fourth  Principal  Investigator’s  (PI)  Meeting  of  the  Innovative  Molecular 
Analysis  Technologies  (IMAT)  Program,  Exploring  Human  Cancer:  “Vascular  Proteomics:  Mapping 
targets  on  endothelia  and  caveolae  in  vivo  for  organ-  and  tumor-directed  delivery  and  transcytosis”  (San 
Diego,CA) 

2003  National  Heart,  Lung  and  Blood  Institute’s  Symposium,  From  Genome  to  Disease:  A  Symposium  of 
High  Throughput  Biology.  Microarray  Technology.  “The  vascular  proteome:  Mapping  endothelium  and 
its  caveolae  for  tissue-specific  pharmacodelivery  and  imaging  in  vivo”  (Bethesda,  MD) 

2003  Second  Annual  Meeting  of  The  Society  for  Molecular  Imaging.  “The  Vascular  Proteome:  Tissue- 
specific  Imaging  of  the  Vasculature  from  the  Whole  Body  to  Electron  Microscopy”  (San  Francisco,  CA) 
2003  American  Heart  Association,  57th  Annual  Fall  Conference  and  Scientific  Sessions  of  the  Council  for 
High  Blood  Pressure  Research  in  association  with  the  Council  on  Kidney  in  Cardiovascular  Disease, 
Special  Keynote  Lecture,  "Caveolae  Integrate  Signaling  and  Trafficking  as  Sensore  of  Hemodynamic 
Flow  and  Vasoactivators"  (Washington,  DC) 

2003  California  Breast  Cancer  Research  Program  2003  Symposium,  From  Research  to  Action:  A  Decade  of 
Progress  in  San  Diego,  "Targeting  Breast  Cancer  Blood  Vessels"  (San  Diego,  CA) 

2003  University  of  Illinois  at  Chicago,  Department  of  Pharmacology,  College  of  Medicine,  "Molecular 
Architecture  and  Function  of  Caveolae  in  Endothelium"  (Chicago,  IL) 

2003  Northwestern  University  Division  of  Nephrology/Hypertension,  "Function  of  Caveolae  in  Vascular 
Endothelium:  Proteomic  Mapping  and  Imaging  for  Tissue  Specific  Delivery"  (Chicago,  IL) 

2003  Molecular  Imaging  Program  at  Stanford  (MIPS)  Seminar  Series,  "Unmasking  IV-Accessible  Vascular 
Targets  Specific  to  Solid  Tumors  and  Single  Organs  by  Tissue  Subfractionation,  Subtractive  Proteomics, 
and  Molecular  Imaging"  (Stanford,  CA) 

2003  12th  International  Conference  on  Gene  Therapy  of  Cancer,  "Mapping  Accessible  Tumor  Vascular 
Targets  through  Subtractive  Proteomics  and  Molecular  Imaging  in  Vivo"  (San  Diego,  CA) 

2003  Immunogen  Corporation  Lecture,  "Unmasking  Endothelial  Cell  Surface  Proteins  for  Targeting  Solid 
Tumors  in  Vivo"  (Boston,  MA) 

2003  American  Society  of  Cell  Biology,  43rd  Annual  Meeting,  "Distinct  Mechanisms  of  Endocytosis  for 
Antibodies  Targeting  Mature  Caveolae  vs.  Sequestering  GPI-AP  in  Lipid  Rafts  to  Form  New  Caveolae" 
(San  Francisco,  CA) 

2004  American  Association  of  Cancer  Research  Advances  in  Cancer  Research,  "Unmasking  IV-Accessible 
Tumor  Vascular  Targets  by  Tissue  Subfractionation,  Subtractive  Proteomics  and  Molecular  Imaging" 
(Waikoloa,  Hawaii) 

2004  First  International  Conference  on  Genomics,  Signaling  and  Tumor  Targeting,  "Unmasking  specific  IV- 
accessible  immunotargets  in  lung  and  solid  tumors  via  prefractionated  subtractive  proteomics  and 
molecular  imaging  in  vivo."  (San  Diego,  CA) 

2004  Keystone  Symposia,  G-Protein-Coupled  Receptors:  Evolving  Concepts  and  Drug  Discovery, 
"Differential  Integration  of  Signaling  and  Trafficking  in  Caveolae  and  Lipid  Rafts  by  G  Proteins  and 
CPCRs"  (Taos,  New  Mexico). 

2004  Mayo  Clinic,  Seminar,  “Proteomic  Mapping  of  Endothelium  and  its  Caveolae  for  Rapid  Organ-  and 
Tumor-specific  Imaging,  Transcytosis,  and  Therapy  in  Vivo”  (Rochester,  MN) 
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2004  Second  International  Conference  on  Vascular  Targeting,  Ligand  Based  Vascular  Targeting.  “Caveolar 
Proteins”  (Miami,  FL) 

2004  Third  Annual  Congress  on  the  Future  of  Breast  Cancer,  “Caveolae:  Mining  Little  Caves  for  New  Cancer 
Treatment”  (Southampton,  Bermuda) 

2004  Gordon  Research  Conference,  Endothelial  Cell  Phonotypes  in  Health  and  Disease.  “Direct  Proteomic 
Mapping  and  Functional  Imaging  of  the  Endothelial  Cell  Surface  and  its  Caveolae  in  Vivo”  (Andover, 
NH) 

2004  Third  Annual  Meeting  of  the  Society  for  Molecular  Imaging,  Enhancing  Delivery  and  Traversing 
Barriers  (co-chair),  “Mapping  and  Targeting  Caveolae  to  Overcome  Endothelial  Cell  Barriers  for 
Tissue-  and  Tumor-Specific  Delivery  in  Vivo”  (St.  Louis,  MO) 

2004  16th  EORTC  -  NCI  -  AACR  Symposium  on  Molecular  Targets  and  Cancer  Therapeutics,  “Subtractive 
Proteomic  Mapping  of  the  Endothelial  Surface”  (Geneva  Switzerland) 

2004  AACR  Advances  in  Proteomics  in  Cancer  Research.  “Unmasking  Specific  IV-accessible  Targets  in 
Endothelial  Caveolae  of  Solid  Tumors  via  Prefractionated  Subtractive  Proteomics  and  Molecular 
Imaging”  (Key  Biscayne,  FL) 

2004  American  Heart  Association  Annual  Meeting  2004,  “Cholesterol-regulated  Signaling  in  Lipid 
Rafts/Caveolae”  (New  Orleans,  LA) 

2004  Vascular  Targeting  Agents  Therapeutic  and  Diagnostic  Development  Conference,  (Keynote  Speaker) 
“Proteomic  Mapping  and  Immunotargeting  of  Endothelium  and  its  Caveolae  for  Organ-  and  Tumor- 
Specific  Imaging  and  Therapy  Tissue  ”  (Cambridge,  MA) 

2004  Cambridge  Health  Institute  Eighth  Annual  Functional  Genomics  Meeting,  “Rapid  Discovery  and 
Validation  of  Accessible  Tissue-Specific  Targets”  (Boston,  MA) 

2004  Cambridge  Health  Institute  In  Vivo  Molecular  Imaging  Meeting,  “Tumor  Target  Discovery  and 
Validation  through  Integration  of  Subtractive  Proteomics  and  Molecular  Imaging  InVivo”  (La  Jolla, 
CA) 

Conclusions 

This  has  been  a  year  of  very  significant  progress.  The  work  described  above  to  validate  the  caveolar- 
and  tumor-localization  of  TE1-5  has  revealed  a  strong  tumor-  and  caveolae-specificity  of  TE3  (identified  as 
Annexin  Al)  in  both  rat  and  human  normal  and  neoplastic  tissues.  We  have  demonstrated  specific  tumor 
accumulation  of  AnnAl  antibodies  in  rats  shows  that  proteins  expressed  in  caveolae  can  indeed  be  accessible  by 
IV  injection  for  tumor-specific  delivery  in  vivo.  These  results  make  TE3/AnnAl  our  most  promising  candidate 
for  targeting  breast  tumor  caveolae  to  treat  human  breast  cancer.  This  proteomic  analysis  is  providing  promising 
candidates  for  breast  tumor-induced  proteins  on  endothelial  cell  surface  caveolae  that  may  be  useful  for 
specifically  targeting  drugs  or  genes  to  breast  tumors  while  sparing  bystander  tissues  and  decreasing  side 
effects.  Thus,  these  studies  appear  promising  and  may  provide  important  site-specific  targets  for  the  "magic 
bullets"  envisioned  many  decades  ago  as  a  means  to  provide  improved,  more  efficacious,  localized  anti-cancer 
therapy  delivered  to  the  tumor  cells.  These  studies  have  recently  been  published  in  the  journal  Nature.  The 
proteomic  map  of  the  normal  lung  endothelial  cell  surface,  which  was  a  critical  control  for  the  subtraction 
analysis  to  identify  tumor-induced  candidate  targets,  has  been  published  recently  in  the  journal  Nature 
Biotechnology. 
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Direct  proteomic  mapping  of  the  lung  microvascular 
endothelial  cell  surface  in  vivo  and  in  cell  culture 

Eberhard  Durr  Jingyi  Yu  \  Karolina  M  Krasinska  \  Lucy  A  Carver  *,  John  R  Yates  III 2,  Jacqueline  E  Testa 
Phil  Oh  1  &  Jan  E  Schnitzer  1 


Endothelial  cells  can  function  differently  in  vitro  a nd  in  vivo;  however,  the  degree  of  microenvironmental  modulation  in  vivo 
remains  unknown  at  the  molecular  level  largely  because  of  analytical  limitations.  We  use  multidimensional  protein  identification 
technology  (MudPIT)  to  identify  450  proteins  (with  three  or  more  spectra)  in  luminal  endothelial  cell  plasma  membranes  isolated 
from  rat  lungs  and  from  cultured  rat  lung  microvascular  endothelial  cells.  Forty-one  percent  of  proteins  expressed  in  vivo  are  not 
detected  in  vitro.  Statistical  analysis  measuring  reproducibility  reveals  that  seven  to  ten  MudPIT  measurements  are  necessary  to 
achieve  >95%  confidence  of  analytical  completeness  with  current  ion  trap  equipment.  Large-scale  mapping  of  the  proteome  of 
vascular  endothelial  cell  surface  in  vivo ,  as  demonstrated  here,  is  advisable  because  distinct  protein  expression  is  apparently 
regulated  by  the  tissue  microenvironment  that  cannot  yet  be  duplicated  in  standard  cell  culture. 


A  monolayer  of  highly  attenuated  endothelial  cells  (ECs)  lines  all  blood 
vessels  to  form  a  physiologically  vital  interface  between  the  circulating 
blood  and  the  underlying  cells  inside  the  tissue.  This  blood-tissue  inter¬ 
face  plays  an  important  role  in  controlling  the  passage  of  blood  mole¬ 
cules  and  cells  into  the  tissue  and  in  many  physiological  functions, 
including  vasoregulation,  coagulation  and  inflammation,  as  well  as  tis¬ 
sue  nutrition,  growth,  survival,  repair  and  overall  organ  homeostasis 
and  function1.  Disruption  of  the  vascular  endothelium  and  its  normal 
barrier  function  can  lead  rapidly  to  tissue  edema,  hypoxia,  pathology 
and  even  organ  death2. 

Although  the  microenvironment  of  the  tissue  surrounding  the  blood 
vessels  clearly  influences  EC  phenotype3-8,  little  molecular  information 
is  available  regarding  vascular  endothelium  as  it  exists  in  native  tissue. 
This  is  in  large  part  because  of  technical  limitations  in  molecular  profil¬ 
ing  of  a  cell  type  that  represents  such  a  small  percentage  of  the  cells  in  the 
tissue.  Past  approaches  have  analyzed  endothelial  cells  isolated  from  tis¬ 
sue  by  enzymatic  digestion  and  sorting  of  the  released  single  cells  using 
EC  markers9-12.  Although  the  study  of  isolated  and  even  cultured  ECs  in 
vitro  has  yielded  much  functional  and  molecular  information,  both 
enzymatic  and  mechanical  tissue  disassembly  and  growth  in  culture 
contribute  to  phenotypic  changes  that  result  in  morphological  alter¬ 
ations  as  well  as  loss  of  native  function  and  protein  expression'. 
Although  expected  to  be  substantial,  the  molecular  differences  between 
ECs  in  vivo  and  in  vitro  are  unknown.  The  reported  ability  of  cells  and 
phage-displayed  peptides  to  home  to  specific  tissues  of  the  body  after 
intravenous  injection  also  provides  indirect  evidence  supporting  the 
molecular  heterogeneity  of  the  endothelial  cell  surface  among 
organs13,14  but  has  not  yet  facilitated  global  mapping  of  EC  surface  pro¬ 
teins  in  vivo. 


Here,  we  report  our  initial  efforts  to  provide  a  large-scale  mapping  of 
the  cell-surface  proteome  of  microvascular  endothelium  as  it  exists  in 
rat  lung  tissue  in  vivo  and  in  cultured  rat  lung  ECs. 

RESULTS 

Sample  preparation  and  quality  control 

The  silica-coating  methodology1 5-19  was  used  to  isolate  luminal  EC 
plasma  membranes  from  rat  lungs  and  from  cultured  rat  lung  micro¬ 
vascular  ECs  (RLMVEC;  isolated  and  grown  in  culture  as  in  our 
past  work20)  for  comparative  proteomic  analysis  by  two-dimensional 
liquid  chromatography  tandem  mass  spectrometry  (2D  LC-MS/MS). 
Rigorous  quality  control  testing  established  >95%  purity  for  silica- 
coated  luminal  EC  plasma  membranes  (P)  in  EC  plasma  membranes 
based  on  a  >20-fold  enrichment  relative  to  total  homogenate/lysate  (H) 
in  more  than  two  EC  surface  marker  proteins  and  >20-fold  depletion 
in  well-established  markers  of  subcellular  organelles  (Fig.  la  and 
Methods).  For  rat  lung  P,  we  also  required  >20-fold  depletion  in  mem¬ 
brane  protein  markers  from  other  cell  types  present  in  lung  or  blood 
(Fig.  lb  and  Methods). 

Analytical  completeness  of  large  scale  MS/MS  analysis 

To  analyze  rat  lung  EC  surface  proteins  comprehensively,  we  did  >30 
measurements  using  MudPIT21  on  P  isolated  from  rat  lungs  and 
RLMVEC.  These  data  were  entered  into  a  relational  database  that 
allowed  us  to  combine  results  of  individual  measurements  for  analysis 
separately  or  in  totality. 

To  estimate  the  number  of  MudPIT  measurements  required  for  com¬ 
pleting  as  comprehensive  an  analysis  as  possible  with  our  instrumenta¬ 
tion  and  protein  database,  we  statistically  analyzed  repeated  MudPIT 


lSidney  Kimmel  Cancer  Center.  10835  Altman  Row.  San  Diego.  California  92121.  USA.  2The  Scripps  Research  Institute.  La  Jolla.  California  92037.  USA. 
Correspondence  should  be  addressed  to  J.E.S.  (jschnitzerGskcc.org). 
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Figure  1  Quality  analysis  of  rat  lung  P  and  RLMVEC  P  by  western  blot  analysis.  Proteins  (5  pg) 
were  resolved  by  SDS-PAGE,  transferred  to  nitrocellulose  and  immunoblotted  with  antibodies  to 
the  indicated  proteins,  (a)  Western  blot  analysis  of  proteins  from  rat  lung  P  versus  RLMVEC  P  using 
antibodies  to  the  indicated  organellar  membrane  proteins,  (b)  Western  blot  analysis  of  proteins  isolated 
from  rat  lung  P  using  antibodies  for  the  indicated  cell  types.  Note  that  AQP1  has  been  found  at  the 
surfaces  of  both  ECs  and  erythocytes19.  H,  total  homogenate/lysate;  P,  silica-coated  endothelial  cell 
plasma  membranes*,  VE-cad,  vascular  endothelial  cadherin;  E-cad,  epithelial  cadherin;  FSP,  fibroblast 
surface  protein;  GlyA.  glycophorin  A;  AQP1,  aquaporin  1;  ERP72,  protein  disulfide  isomerase  A4;  58K, 
Golgi  marker  p58;  (TCOP,  coatomer  beta  subunit;  EEA1,  early  endosomal  antigen  1;  CytoC,  cytochrome 
C;  5'NT,  5'  nucleotidase;  eNOS.  endothelial  nitric  oxide  synthase;  Cavl,  caveolin  1;  WBC,  leukocyte; 
RBC,  erythrocyte;  EC,  endothelial  cell;  ER,  endoplasmic  reticulum;  lyso,  lysosome;  endo.  endosome; 
mito,  mitochondria;  PM,  plasma  membrane. 


measurements  to  determine  at  which  point  additional  experiments  pro¬ 
duce  few  newly  identified  proteins.  Using  an  identification  criterion  of 
two  or  more  MS/MS  spectra  of  true  tryptic  peptides,  we  found  that 
-66%  of  the  identified  proteins  in  any  measurement  of  rat  lung  P  were 
confirmed  by  any  second  measurement  and  34%  of  the  proteins 
remained  unconfirmed  (Fig.  2).  Adding  a  third  measurement  decreased 
newly  identified  proteins  to  20%.  After  ten  measurements,  this  number 
leveled  off  at  4.2%  (that  is,  proteins  not  found  in  the  other  nine  meas¬ 
urements).  We  use  the  term  “confidence  of  analytical  completeness”  to 
describe  this  definition  for  reproducibility.  There  were  no  differences  in 
this  statistical  curve  when  the  comparatively  analyzed  measurements 
were  from  the  same  rat  lung  P  sample  or  from  P  samples  produced  inde¬ 
pendently.  An  average  260  (±47)  unique  proteins  were  identified  in  each 
Mud  PIT  experiment  with  two  or  more  MS/MS  spectra.  The  same  95% 


Figure  2  Reproducibility  of  protein  identification  in  rat  lung  P.  The 
confidence  of  analytical  completeness  is  calculated  as  a  function  of  the 
number  of  analytical  runs.  Each  point  represents  the  percentage  of  proteins 
identified  only  in  an  individual  run  compared  to  all  preceding  experiments. 
O,  rat  lung  P*  •  RLMVEC  P. 


confidence  of  analytical  completeness  was 
achieved  for  the  RLMVEC  P  analysis  after 
seven  measurements  ( Fig.  2). 

Endothelial  cell  surface  proteome  in  lung 

In  rat  lung  P,  we  identified  450  nonredundant 
proteins  with  three  or  more  distinguishable 
MS/MS  spectra  of  true  tryptic  peptides  (see 
Supplementary  Table  1  online).  Although  pro¬ 
teins  covered  by  three  MS/MS  spectra 
accounted  for  the  largest  subpopulation,  70% 
of  all  proteins  listed  in  Supplementary  Table  1 
were  covered  by  more  than  three  MS/MS  spec¬ 
tra  (Fig.  3a).  The  distribution  of  MS/MS  spec¬ 
trum  coverage  followed  roughly  a  single 
exponential  decay  in  the  range  of  3  to  12 
MS/MS  spectra.  Exceptions  included  highly 
abundant,  highly  covered  proteins,  such  as 
fodrin,  spectrin  or  plectin. 

To  assess  possible  analytical  bias  in  the  pro¬ 
tein  identification,  we  calculated  the  molecular 
weights  and  isoelectric  points  (pis)  of  each 
protein  based  on  its  primary  amino  acid 
sequence.  All  450  proteins  identified  in  rat  lung 
_  P  showed  a  typical  molecular  weight  distribu¬ 
tion  (15-600  kDa)  with  a  maximum  of  29% 
between  25  and  50  kDa  and  >80%  below 
200  kDa  ( Fig.  3b ) .  Their  pis  ranged  from  4.0  to  1 2.0  with  bimodal  distri¬ 
bution  maxima  at  5.75  and  9.0  ( Fig.  3c).  About  25%  of  these  proteins  fall 
outside  the  typical  limits  of  protein  resolution  by  2D-gel  electrophoresis 
(<100  kDa;  pi  4.S-8.5)  (Fig.  3d). 

We  classified  the  proteins  identified  in  rat  lung  P  according  to  cur¬ 
rently  available  data.  All  proteins  fit  in  two  broad  categories  within  the 
current  rat  protein  database  ( Fig.  4a):  characterized  proteins  (74%)  and 
proteins  yet-to-be-characterized  (26%),  which  includes  entries 
described  as  genes  of  unknown  function  and  hypothetical  proteins.  Of 
the  characterized  proteins  (Fig.  4b),  81%  can  be  associated  with  the 
plasma  membrane.  The  remainder  are  reported  to  localize  primarily  to 
ribosomes  (3.6%),  endoplasmic  reticulum  (ER)  (4.5%),  mitochondria 
(5.1%)  and/or  the  nucleus  (6.3%)  and  may  include  proteins  that  may 
exist  at  more  than  one  site  in  the  cell. 

We  also  classified  the  identified  proteins  according  to  their  interaction 
with  the  plasma  membrane  as  well  as  their  orientation  and  structure  in 
the  membrane.  Of  the  proteins  residing  at  the  plasma  membrane,  31% 
were  integral  or  lipid-anchored  membrane  proteins,  25%  were 
cytoskeletal  and/or  junctional  proteins,  35%  were  inner  peripheral 
membrane  proteins  and  8.6%  were  externally  bound  ( Fig.  4c). 

Cytoskeletal  proteins  had  the  best  MS/MS  spectrum  coverage  in  our 
sample  (23.6  spectra/protein)  followed  by  inner  peripheral  membrane- 
associated  proteins  (8.9  spectra/protein).  Transmembrane  spanning  and 
lipid-anchored  proteins  had  less  coverage  (both  with  6.9  spectra/pro¬ 
tein)  (Fig.  5a).  The  majority  of  the  transmembrane  spanning  proteins 
identified  in  rat  lung  F  contained  a  single  transmembrane  domain 
(Fig.  5b).  Integral  membrane  proteins  that  are  more  hydrophobic, espe¬ 
cially  those  with  multiple  transmembrane  domains,  have  fewer  tryptic 
cleavage  sites  (hydrophilic  residues  R  and  K)  and  are  frequently  glycosy¬ 
lated,  which  may  prevent  access  of  the  protease  and  reduce  the  number 
of  possible  identifiable  tryptic  peptides.  For  example,  based  on  a  calcula¬ 
tion  of  theoretical  tryptic  digests,  G-protein-coupled  receptors  have 
only  1.25  tryptic  peptides/ 10  kDa  with  properties  likely  to  produce  good 
MS/MS  spectra  (based  on  an  evaluation  of  a2-adrenergic  receptor, 
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Figure  3  Spectrum  coverage  per  protein  and  physicochemical  properties,  (a)  Distribution  of  MS/MS  spectrum  coverage  of  proteins  identified  in  rat  lung  P. 
The  fitting  parameters  of  this  curve  have  no  physical  meaning,  (b)  Molecular  weight  distribution  of  identified  proteins,  (c)  pi  distribution  of  identified 
proteins,  (d)  The  calculated  pi  of  identified  proteins  were  plotted  against  their  calculated  molecular  weight  on  a  logarithmic  scale.  Box  indicates  typical 
limitations  for  resolving  proteins  within  2D  gels.  In  a-c,  the  number  of  proteins  falling  into  each  classification  is  plotted  (left  y-axis)  as  well  as  the  percentage 
of  the  total  number  proteins  identified  falling  into  each  classification  (right  y-axis). 


pl B  adrenergic  receptor,  muscarinic  receptor  Ml,  B1  bradykinin  recep¬ 
tor  and  angiotensin  II  receptor  1A).  Soluble  proteins  such  as  bovine 
serum  albumin  average  about  six  tryptic  peptides/ 10  kDa  of  protein 
mass.  We  identified  19  G-protein-coupled  receptors  in  lung  P,  but  only 
with  one  or  two  spectra.  Therefore,  the  relative  abundance  of  membrane 
proteins  in  the  sample  may  be  higher  than  indicated  by  our  data. 

Lastly,  we  categorized  the  identified  proteins  according  to  their 
reported  function  (Fig.  4d):  29%  and  25%  of  the  identified  proteins 
were  signaling  and  structural  proteins,  respectively;  7.8%  were  involved 
in  caveolae-  or  clathrin-mediated  cell  surface  membrane  trafficking; 
5.2%  were  involved  in  cell-cell  adhesion,  including  junctional  proteins; 


4.5%  were  exterior  enzymes,  an  important  group  known  to  be  especially 
enriched  on  the  lung  endothelium,  and  which  metabolize  blood-borne 
hormones  and  vasoactive  compounds;  4.1%  were  transporter  molecules 
mediating  the  movement  of  small  molecules  ( ions  and  water)  across  the 
membrane  bilayer;  and  25%  of  the  proteins  were  not  easily  categorized 
and  labeled  other.' 

Comparison  of  EC  surface  proteins  in  vivo  and  in  vitro 

To  assess  protein  expression  of  ECs  growing  in  native  conditions  in  vivo 
versus  in  cell  culture  in  vitro ,  we  compared  proteins  in  RLMVEC  P  ver¬ 
sus  rat  lung  P  ( Fig.  5c  and  Supplementary  Table  1  online).  We  allowed  a 
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■  Characterized  proteins  (74%) 
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■  Nucleus  (6.3%) 
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■  integral  and  lipid -anchored  plasma  membrane  (31%) 
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Figure  4  Classification  of  proteins  identified  in  rat  lung  P.  (a-d)  Pie  charts  of  the  identified  proteins  classified  according  to  their  degree  of  characterization 
in  the  rat  database  (a);  the  expected  primary  subcellular  localization  of  the  characterized  proteins  (b);  subclasses  of  plasma  membrane  proteins  (c);  and 
functional  characterization  of  the  plasma  membrane  proteins  (d). 
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Figure  5  Analysis  of  identified  proteins,  (a)  Spectrum  coverage  per 
subcellular  location.  The  average  MS/MS  spectrum  coverage  per  protein  is 
shown  for  each  subcellular  location.  Error  bars  indicate  standard  deviation; 

*  denotes  proteins  that  are  frequently  underrepresented  in  proteomics 
analysis  usually  showing  lower  spectrum  coverage  because  of  low  accessibil¬ 
ity  and  biophysical  properties,  (b)  Distribution  of  the  number  of  known  or 
predicted  transmembrane-spannmg  alpha  helices  in  all  integral  membrane 
proteins  identified  in  rat  lung  P.  The  number  of  predicted  transmembrane 
domains  per  protein  is  plotted  against  the  number  of  proteins  falling  into 
each  classification  (left  y-axis)  and  as  a  percentage  of  the  total  number  of 
integral  membrane  proteins  identified  (right  y-axis).  (c)  Localization  of 
proteins  identified  in  rat  lung  P  and  RLMVEC  P.  Histogram  of  the  number 
of  proteins  identified  in  RLMVEC  P  and  rat  lung  P  plotted  according  to  their 
subcellular  location.  Percentage  indicates  proportion  of  proteins  identified 
in  RLMVEC  P  versus  lung  P  in  each  category. 


minimum  coverage  of  one  high-quality  MS/MS  spectrum  to  identify  a 
protein  in  RLMVEC  P  as  long  as  the  protein  had  been  identified  previ¬ 
ously  with  higher  stringency  (three  or  more  spectra)  in  rat  lung  P.  We 
found  263  out  of  the  450  proteins  identified  in  rat  lung  P  also  in 
RLMVEC  P  with  the  remaining  187  proteins  (42%)  detected  only  in  rat 
lung  P,  suggesting  possible  modulation  by  the  in  vivo  tissue  microenvi¬ 
ronment  not  reproduced  in  cell  culture.  If  we  used  the  same  three  pep¬ 
tide  criteria  as  for  rat  lung  P,  only  32%  ( 145  of  450  proteins)  overlapped 
with  RLMVEC  P.  Even  greater  differences  were  apparent  in  select  cate¬ 
gories  of  proteins.  Approximately  50%  of  the  integral  and  peripheral 
membrane  proteins  as  well  as  yet-to-be-characterized  proteins  were 
solely  identified  in  rat  lung  P.  Only  35%  of  blood  and  secreted  proteins 
from  rat  lung  P  were  detected  in  RLMVEC  P.  We  identified  in  RLMVEC 
P  only  -30%  of  the  trafficking  proteins  detected  in  rat  lung  P,  consistent 
with  cultured  ECs  expressing  greater  than  tenfold  less  caveolae  relative 
to  native  ECs  in  vivo.  Proteins  primarily  localized  to  the  cytoskeleton, 
mitochondria,  ER  and  nucleus  were  identified  in  similar  proportions  in 
RLMVEC  P  and  rat  lung  P. 

High  prevalence  of  known  EC  surface  proteins 

We  identified  73  known  vascular  EC  surface  proteins  in  rat  lung  P  and 
RLMVEC  P  (see  Supplementary  Table  2  online).  Because  these  proteins 
are  known  markers  of  the  EC  surface,  we  lowered  the  stringency  for 
identification  to  one  or  more  true  tryptic  peptide  MS/MS  spectra  of 
high  quality  as  determined  by  manual  inspection.  Of  the  65  EC  marker 
proteins  identified  in  rat  lung  P,  22  (34%)  were  identified  with  one  spec¬ 
trum,  12  ( 18%)  with  two  spectra  and  31  (48%)  with  three  or  more  spec¬ 
tra.  Moreover,  41  of  the  65  proteins  (63%)  were  identified  solely  in  rat 


lung  P.  In  RLMVEC  P,  we  found  32  EC  marker  proteins  ( 12  (38%)  with 
one  spectrum,  3  (9%)  with  two  spectra  and  17  (53%)  with  three  or  more 
spectra),  of  which  8  were  not  identified  in  lung  P.  This  list  could  be 
increased  greatly  by  adding  cytoskeletal  proteins  shown  to  exist  in  ECs. 

Validation  of  protein  expression 

To  validate  proteins  identified  by  MS/MS,  we  performed  western  blot 
analysis  for  a  subset  of  22  proteins  (chosen  relatively  randomly  based  on 
antibody  availability;  Fig.  6a).  We  identified  16  of  these  proteins  by 
MS/MS  in  rat  lung  P  but  not  in  RLMVEC  P.  This  finding  was  confirmed 
by  western  blot  analysis  in  14  out  of  the  16  cases  (87.5%)  with  only  Trk- 
A  and  Lyn  B  also  detected  in  RLMVEC  P.  Nine  proteins  not  detected  in 
RLMVEC  P  were  known  EC  markers  (aquaporin,  angiotensin  convert¬ 
ing  enzyme  (ACE),  carbonic  anhydrase  (CA),  endothelin-converting 
enzyme  (ECE),  podocalyxin,  receptor  for  advanced  glycation  end  prod¬ 
uct  (RAGE),  zona  occludens  (ZO)-2,  TIE-2,  VE-cadherin)  and  four  of 
the  remaining  seven  proteins  had  either  a  predicted  transmembrane 
domain  (Trk-A,  P2Z  receptor,  seven  transmembrane  receptor)  or  a  lipid 
anchor  (LynB),  suggesting  plasma  membrane  localization.  Western  blot 
analysis  confirmed  all  five  proteins  (aminopeptidase  N  ( APN),caveolin- 
1,  HSP47,  FLJ 10849  and  ICAM-1)  identified  by  MS  in  both  rat  lung  P 
and  in  RLMVEC  P.  We  detected  5'NT  by  MS  analysis  solely  in  RLMVEC 
P  with  ten  MS/MS  spectra,  but  we  detected  it  enriched  in  P  over  H  in 
both  lung  P  and  RLMVEC  P  by  western  blot  analysis,  albeit  with  far 
more  expression  in  RLMVEC  P.  The  amount  of  5'NT  in  rat  lung  P  may 
have  been  below  the  detection  sensitivity  of  the  MS  analysis. 

This  western  blot  analysis  confirmed  the  expression  of  several  pro¬ 
teins  that  apparently  have  not  been  identified  before.  Monospecific, 
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Figure  6  Protein  validation,  (a)  Western  blot  analysis  was  performed  on  5  pg  of  total  tissue  or  cell  homogenates  (H)  and  EC  plasma  membranes  (P)  of 
rat  lung  endothelium  and  RLMVEC  using  the  indicated  antibodies,  (b)  Tissue  immunostaining  of  select  proteins  identified  in  endothelial  membranes. 
Cryosections  of  rat  lung  tissue  stained  with  antibodies  recognizing  the  indicated  proteins.  Arrowheads  denote  typical  examples  of  open  blood  microvessels. 
Bar,  10  pm.  ACE,  angiotensin  converting  enzyme;  APN,  aminopeptidase  N;  AQP1,  aquaporin  1;  CA,  carbonic  anhydrase;  ECE,  endothelm-converting 
enzyme;  HSP47,  47-kDa  heat  shock  protein;  ICAM-1,  intercellular  adhesion  molecule  1;  NHERF-2,  NaVH*  exchange  regulatory  cofactor  NHE-RF2;  P2X7, 
P2X  purinergic  receptor  7;  Podo,  podocalyxin;  RAGE,  receptor  for  advanced  glycation  end  products;  STR,  seven  transmembrane  receptor;  VE-cad,  vascular 
endothelial-cadherin;  ZO-2,  zona  occludens  2;  cavl,  caveolin-1. 


affinity-purified  polyclonal  antibodies  that  we  generated  to  synthetic 
peptides  confirmed  not  only  our  MS  findings  but  also  the  existence  of 
two  proteins  listed  as  hypothetical  in  the  database.  The  antibodies 
detected  a  59-kDa  protein  (FLJ 10849)  in  both  rat  lung  P  and  RLMVEC 
P  and  an  82-kDa  protein  (HSPA12B)  only  in  rat  lung  P,  consistent  with 
the  predicted  molecular  weight  of  these  proteins.  Other  verified  new 
findings  included  the  enrichment  in  P  of  TIE-2,  APN,  ICAM-1  and 
nucleolin,  whose  expression  at  the  EC  surface  was  reported  to  be 
induced  during  angiogenesis  and/or  inflammation  (TEM4  and  TEM610 
were  also  identified  but  no  antibodies  are  available  for  validation).  Two 
proteins,  P2Z7  and  Trk-A,  have  not  previously  been  localized  to  ECs  but 
rather  to  neuronal  cells  and  tissues. 

We  also  used  immunohistochemical  staining  of  rat  lung  tissue  sec¬ 
tions  to  validate  EC  expression  of  23  proteins  (chosen  randomly  by 
availability  of  antibodies  suitable  for  immunohistochemistry).  ACE, 
CD26,  HSPA12B,  NHERF-2,  P2X7,  Pincher,  PV-1,  podocalyxin, 
PECAM  (CD31)  and  OX-45  were  expressed  on  lung  blood  vessels 
( Fig.  6b).  Caveolin- 1,  ECE,  aquaporin  1 ,  Nedd5,  CA,  TIE2,  RAGE,  Glut4, 
ZO-1,  ICAM-1,  thrombomodulin,  CD39  and  aminopeptidase  P  were 
similarly  detected  on  lung  endothelium  (data  not  shown).  Tissues 
stained  with  secondary  antibody  alone  were  negative.  Thus,  these  analy¬ 
ses  together  validated  27  proteins  newly  identified  on  the  lung  endothe¬ 
lial  cell  surface  through  our  MS  analysis  of  rat  lung  P  (e.g.,  Trk-A,  P2X7, 
seven  transmembrane  receptor,  NHERF-2,  HSPA12B,  pincher,  Nedd  3, 
Glut4).  See  Supplementary  Notes  online  for  an  expanded  description  of 
findings  and  validation. 

DISCUSSION 

Our  estimation  of  the  confidence  of  completeness  provides  a  useful 
gauge  of  the  reproducibility  and  relative  comprehensiveness  of  individ¬ 
ual  MudPIT  measurements  as  well  as  the  number  of  measurements 
required  to  achieve  a  statistically  defined  level  of  completeness.  The 
probability  that  the  acquisition  of  tandem  mass  spectra  will  be  exactly 
reproduced  is  inversely  proportional  to  the  complexity  and  dynamic 
range  of  the  peptide  mixture.  Likewise,  reproducibility  of  the  data 
depends  on  the  complexity  of  the  sample  as  well  as  the  quality  and  com¬ 
position  of  peptides  from  independent  sample  preparations  originating 
from  different  animals.  Thus,  it  seems  prudent  to  do  a  statistical  analysis 


from  several  measurements  to  assess  this  variation  and  to  estimate 
the  degree  to  which  the  sample  can  be  comprehensively  described  with 
a  given  amount  of  independent  measurements.  Our  analysis  here 
required  7-10  MudPit  measurements  to  achieve  >95%  confidence  of 
analytical  completeness  possible  with  the  current  equipment,  database 
and  analytical  approach.  Note  that  the  confidence  of  analytical 
completeness  may  depend  on  many  factors — including  the  complexity 
of  the  individual  sample  being  analyzed,  sample  preparation,  the  quality 
of  the  chromatography,  the  sensitivity  of  the  instrument  and  the 
completion  of  the  rat  protein  database — and  must  be  reevaluated  for 
each  project. 

Based  on  the  current  rat  database,  74%  of  the  proteins  identified  in 
the  isolated  EC  surface  have  been  characterized  to  some  extent,  includ¬ 
ing  functionally,  structurally  and/or  with  localization  data.  But  26%  are 
new  and  not  yet  categorized  in  the  rat  database.  Based  on  the  current  lit¬ 
erature,  >80%  of  the  characterized  proteins  have  a  solid  reason  or  exper¬ 
imental  evidence  supporting  their  association  with  the  plasma 
membrane.  Yet,  despite  rigorous  quality  testing  of  each  membrane 
preparation  by  western  blot  analysis,  we  have  identified  some  proteins 
that  at  first  glance  may  not  appear  to  belong  at  the  plasma  membrane, 
including  blood  proteins  and  proteins  of  intracellular  organelles. 

Consistent  with  flushing  the  vasculature  free  of  blood,  our  quality 
testing  detects  little  to  no  presence  of  well-known  marker  proteins  for 
erythrocytes  and  leukocytes  in  P.  Nevertheless,  we  identified  21  known 
blood  proteins.  Given  that  there  are  >500  blood  plasma  proteins  and 
even  more  blood  cell  proteins,  this  number  appears  too  low  to  be 
explained  by  general  contamination.  In  fact,  the  presence  of  a  small  sub¬ 
set  of  blood  proteins  may  be  expected  in  rat  lung  P  and  RLMVEC 
(grown  in  serum).  For  example,  albumin  and  transferrin  can  interact 
specifically  with  the  EC  surface  via  specific  binding  proteins22-24. 
Hemoglobin  alpha  and  beta  chains  may  come  from  small  amounts  of 
cell-free  hemoglobin  in  the  blood  rather  than  simply  erythrocyte  con¬ 
tamination,  especially  in  light  of  the  lack  of  erythrocyte  markers  in  P 
(Fig.  1). 

Likewise,  we  detected  proteins  currently  considered  to  be  located  pri¬ 
marily  in  ER,  mitochondria,  Golgi,  ribosomes  or  nuclei,  which  cumula¬ 
tively  were  -15%  of  the  total  proteins  identified  (20%  of  the 
characterized  proteins).  But,  given  our  stringent  quality  criteria  of 
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>20-fold  enrichment  of  EC  markers  and  >20-fold  depletion  of  specific 
markers  of  these  subcellular  organelles,  we  may  expect  <5%  contami¬ 
nants.  Thus,  general  contamination  of  membranes  from  these 
organelles  seems  remote,  even  more  so  if  one  considers  the  few  some¬ 
times  low-abundance  proteins  identified  (-15  average  per  organelle) 
relative  to  the  many  hundreds  of  proteins  in  each  organelle.  Of  course,  it 
is  possible  that  polycation  ic  proteins,  such  as  histones  and  ribosomal 
subunit  proteins,  released  by  tissue  homogenization  can  adsorb  electro¬ 
statically  to  the  polyanionic  polyacrylic  acid  crosslinker  of  the  silica- 
coated  membranes.  But  it  seems  more  likely  that  most  of  these  proteins 
simply  reside  in  more  than  one  subcellular  location.  It  is  not  unusual  for 
proteins  initially  categorized  as  organelle-specific  to  later  be  discovered 
elsewhere  in  the  cell.  So  far,  four  proteins  identified  here  are  examples  of 
this.  Nucleolin  is  a  well-characterized  nuclear  protein  but  is  expressed 
also  at  the  cell  surface25.  Elongation  factor- 1  alpha  is  known  primarily  as 
a  component  of  the  protein  synthesis  machinery  but  has  a  less  well- 
known  function  in  cytoskeletal  reorganization26.  Protein  disulfide  iso- 
merase,  commonly  classified  as  an  ER  protein,  has  recently  been 
detected  at  the  plasma  membrane  of  platelets  and  liver27.  Hsp47  was  ini¬ 
tially  assigned  to  the  ER,  but  later  discovered  to  be  transported  to  the 
plasma  membrane28.  Thus,  this  analysis  may  provide,  for  some  proteins, 
the  first  evidence  of  a  secondary  localization  at  the  plasma  membrane. 
Protein-protein  interactions  may  further  explain  the  presence  of  select 
proteins  (see  Supplementary  Note  online).  Considering  all  of  these 
issues,  our  list  of  identified  proteins  seems  rather  consistent  with  our 
quality  control  data  showing  >95%  purity  of  the  isolated  membranes. 

Comparative  proteomic  analysis  of  EC  surface  membranes  isolated 
from  rat  lung  versus  cultured  RLMVEC  revealed  striking  differences. 
Only  -51%  of  the  integral  membrane  proteins  and  plasma  mem¬ 
brane-associated  proteins  identified  were  expressed  in  common 
between  rat  lung  P  and  RLMVEC  P.  Interestingly,  65  of  73  (89%)  total 
known  EC  marker  proteins  detected  in  this  study  were  found  in  rat  lung 
P  versus  only  32  (43.8%)  in  RLMVEC  P  (see  Supplementary  Table  2 
online).  We  detected  41  known  markers,  such  as  ACE  and  ECE,  in  rat 
lung  P  but  not  in  RLMVEC  P.  Overall,  more  than  180  (41%)  proteins 
were  detected  in  rat  lung  P  in  vivo  but  not  at  all  in  RLMVEC  P  in  vitro 
(see  Supplementary  Note  online  for  additional  discussion). 

It  is  quite  apparent  through  studying  vasculogenesis,  angiogenesis 
and  the  modulation  of  EC  phenotype  in  culture  and  in  vivo  that  EC 
morphology,  protein  expression  and  function  are  affected  greatly  by 
physical  and  molecular  cues  from  the  basement  membrane,  perivascular 
cells,  circulating  blood  and  even  other  cells  deeper  within  the  tis- 
sueU0.i7,29  isolation  as  well  as  standard,  albeit  limited,  cell  culture 
conditions  seem  not  to  maintain  the  expression  of  proteins  found  in 
vivo ,  which  may  have  functional  consequences.  Here,  we  provide  a 
glimpse  of  how  wide  the  microenvironment-induced  molecular  differ¬ 
ences  between  ECs  in  vivo  and  in  vitro  can  be.  Of  course,  some  of  these 
differences  may  relate  to  the  derivation  of  the  RLMVEC  from  one  seg¬ 
ment  of  the  pulmonary  microvasculature  (arterioles,  capillaries  and 
venules)  whereas  rat  lung  P  is  primarily  EC  plasma  membranes  from  all 
microvessel  segments,  which  constitute  >90%  of  the  total  vascular  sur¬ 
face  area.  Obviously,  more  sophisticated  culturing  conditions  are 
needed  to  maintain  proteins  expressed  in  native  tissue.  Towards  this  end, 
our  results  begin  to  identify  a  set  of  promising  tissue- modulated  EC 
markers  that  can  now  be  used  to  monitor  future  attempts  to  reproduce 
native  EC  phenotype  in  vitro  and  may  contribute  to  a  deeper  under¬ 
standing  of  how  the  tissue  environment  controls  EC  phenotype. 

We  have  described  a  large-scale  proteomic  analysis  of  in  vivo  vascular 
endothelium — more  specifically,  the  luminal  EC  surface  in  lung  tissue. 
Our  identification  of  450  proteins  using  rigorous  criteria  (more  than 
three  peptides)  is  an  improvement  over  many  proteomic  analyses  that 


found  similar  numbers  of  proteins  in  whole  cells,  bacteria,  yeast  and  tis¬ 
sue  preparations  but  usually  with  less  demanding  selection  criteria  (one 
to  two  peptides)30.  Moreover,  two  MS  analyses  of  whole  cultured  ECs 
recently  identified  less  than  60  proteins,  including  only  a  few  proteins 
found  here:  hemoglobin,  albumin,  heat  shock  proteins,  actin,  nucle- 
ophosmin,  vimentin  and  tubulins' U2.  These  studies  did  not  identify 
any  cell  surface  enzymes  or  transmembrane  proteins. 

A  more  detailed  understanding  of  the  proteome  of  specialized  cell 
types  may  help  elucidate  how  cells  fulfill  their  biological  function  in  a 
systems  biological  context.  Differences  in  protein  expression  patterns 
between  differentiated  cells  as  well  as  changes  occurring  during  disease 
are  of  great  interest  in  experimental  medicine  for  diagnosis  and  treat¬ 
ment.  Cell  surface  mapping  of  normal  lung  endothelium  in  vivo  and  in 
vitro  is  a  necessary  prelude  to  understanding  the  extent  to  which  the  tis¬ 
sue  microenvironment  can  modulate  EC  function  and  protein  expres¬ 
sion  not  only  in  normal  tissues  but  also  perhaps  in  diseases.  This 
specialized  proteome  may  have  value  clinically  because  proteins 
expressed  at  this  blood-tissue  interface  are  accessible  to  the  circulating 
blood  and  may  serve  as  targets  for  tissue-specific  drug  and  gene  delivery 
in  vivo29*31,32.  The  technology  described  here  may  help  exploit  such  pos¬ 
sibilities  in  the  future. 

METHODS 

Materials.  Standard  laboratory  chemicals  were  high-performance  liquid  chro¬ 
matography  (HPLC)  grade  or  higher  if  available  and  purchased  from  VWR 
unless  otherwise  noted.  Tris  (2-carboxyethyl)-phosphine  hydrochloride  (TCEP) 
was  obtained  from  Pierce,  iodoacetamide  was  obtained  from  Aldrich.  Porozyme 
bulk-immobilized  trypsin  was  purchased  from  Applied  Biosystems. 
Endoproteinase  Lys-C  was  purchased  from  Roche  Diagnostics.  3x  cell  lysis  buffer 
(CLB)  contains  6  M  Urea,  0.5  M  Trisbase,9  mM  EDTA,  3.6%  p-mercaptoethanol 
and  9%  SDS. 

We  obtained  antibodies  against  the  following:  ACE,  APN,  caveolin-l, 
E-cadherin,  Tie  2,  PECAM  and  VE-cadherin  from  Santa  Cruz  Biotechnology: 
aquaporin-l,  cytochrome  C,  ERp72,  EEA1,  eNOS,  Lamp!,  Lyn,  P2X7,  Ran, 
Transports,  CD26  and  ZO-2  from  BD  Biosciences/Pharmingen;  (i-COP,  fibrob¬ 
last  surface  protein,  glycophorin  A  and  p38K  from  Sigma;  CD4  and  GDI  I  from 
Serotech;  HSP47  from  Stressgen  Biotechnologies;  RAGE  from  Affinity 
Bioreagents;  ECE  from  Zymed  Lab;  Trk-A  from  ABcam  nucleolin  from  Leinco 
Technologies.  Antibodies  against  the  following  were  kind  gifts:  carbonic  anhy- 
drase  IV  from  W.S.  Sly,  St.  Louis  University;  NH ERF-2  from  E.J.  Weinman, 
University  of  Maryland  Hospital33;  5'  nucleotidase  from  Paul  Luzio  (Cambridge, 
England);  seven  transmembrane  protein  (Ig-hepta)  from  Shigehisa  Hirose, 
Tokyo  Institute  of  Technology  (Yokohama,  Japan)34;  OX -45  from  Neil  Barclay, 
University  of  Oxford  (Oxford,  UK)35;  Pincher  from  Simon  Haleguola,  State 
University  of  New  York  (Stony  Brook,  NY);  podocalyxin,  PV-I  and  ICAM-I  were 
produced  in  house. 

Animals.  Sprague- Da wley  male  rats  (150-250  g;  Charles  River  Laboratories) 
were  used  unless  otherwise  indicated.  Animal  experiments  were  done  in  accor¬ 
dance  with  federal  guidelines  following  review  and  approval  by  the  Institutional 
Animal  Care  and  Use  Committee. 

Cell  culture.  Rat  lung  microvascular  ECs  (RLMVEC)  were  isolated  and  grown  as 
described  originally36. 

SDS  page  and  western  blot  analysis.  SDS  PAGE  and  western  blot  analysis  were 
done  as  described  previously18. 

Tissue  immunostaining.  Frozen  rat  lung  tissue  was  cut  (5  pm)  on  a  Microm 
HM505E  cryomicrotome.  Sections  were  fixed  with  neutral  buffered  formalin  for 
5  min  at  25  °C  then  incubated  for  1  h  at  25  °C  in  blocking  solution  (5%  FBS,  0.1% 
Tween  20  in  PBS).  After  a  2-h  incubation  at  room  temperature  in  primary  anti¬ 
bodies  (diluted  in  blocking  solution),  the  sections  were  washed,  treated  with 
the  biotin-conjugated  secondary  antibody  (KPL  Laboratories)  for  1  h  at  25  °C, 
washed  again  and  treated  with  a  streptavidin-conjugated  horseradish  peroxidase 
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(KPL  Laboratories)  for  1  h  at  25  °C.  Immune  complexes  were  detected  using  a 
Liquid  DAB  staining  kit  from  BioGenex.  Sections  were  counterstained  with 
hematoxylin,  dehydrated,  mounted  in  Permount  (Fisher  Scientific)  and  imaged 
digitally  by  light  microscopy  using  a  Nikon  Eclipse  E800  equipped  with  a  Nikon 
digital  camera  DXM 1 200. 

Production  of  polyclonal  antibodies.  Antigenicity  predictions  for  HSPA12B  and 
septin  homolog  FL|  10849  were  done  with  Lasergene  (DNAStar)  using  the  predic¬ 
tion  methods  of  Margalite  and  Berzofsky,  Rothbard  and  Taylor  and  Jameson 
Wolf.  Polyclonal  antibodies  in  rabbit  were  produced  for  HSPA12B  against  the 
peptide  Ac-CGYTARDYYH DLDPEEAR-CON H:  and  for  FLJ10849  against  the 
peptide  Ac-CQLLQSQAQQSGAQQTKKD-CONH,  by  Biosource. 

Isolation  of  luminal  EC  plasma  membranes  from  rat  lungs  and  RLMVEC.  The 
luminal  EC  plasma  membranes  were  isolated  directly  from  rat  lung  tissue  or  cell 
culture  using  a  silica-coating  procedure  as  described1 5,1 6.  Briefly,  a  solution  of 
cationic  colloidal  silica  particles  was  used  to  selectively  coat  the  EC  surface  either 
by  perfosing  the  lung  vasculature  via  the  pulmonary  artery  in  situ  or  by  overlay¬ 
ing  RLMVEC  confluent  monolayers.  After  electrostatic  cross-linking  and 
quenching  with  polyacrylic  acid  to  form  a  stable  membrane  pellicle,  the  tissue  or 
cells  were  homogenized  and  the  sheets  of  P  are  sedimented  by  two  rounds  of 
ultracentrifogation  through  high-density  media  away  from  all  the  other  much 
less  dense  tissue  component  in  the  lung  homogenate  or  the  total  RLMVEC  lysate 
(H).  This  subfractionation  yielded  a  membrane  pellet  highly  enriched  in  EC 
surface  markers  and  markedly  depleted  of  markers  of  other  cell  types  and  subcel- 
lular  organelles1  s~19. 

To  ensure  that  we  use  consistently  high-quality  material,  each  lot  of  H  and 
isolated  P  was  subjected  to  rigorous  quality  control  testing  before  analysis. 
Using  standard  immunoblotting,  we  required  the  purity  level  of  plasma  mem¬ 
branes  in  P  relative  to  H  to  be  at  least  20-fold  enriched  in  at  least  two  EC  surface 
marker  proteins,  such  as  caveolin,  5'-nucleotidase  (5'NT),  VE-cadherin-1, 
angiotensin  converting  enzyme  (ACE)  or  endothelial  nitric  oxide  synthase 
(eNOS).  In  addition,  P  had  to  be  depleted  by  >20-fold  relative  to  H  in  well- 
established  organellar  membrane  markers,  including  nuclear  (ran,  transport  in), 
endosomal  (early  endosome  antigen  1  (EEA1 )),  Golgi  (p58,  (3-COP),  lysosomal 
(lampl),  mitochondrial  (cytochrome  c)  and  endoplasmic  reticulum  (ERp72). 
For  rat  lung  P,  we  also  required  a  >20-fold  depletion  in  protein  markers  of 
membranes  from  other  cell  types  present  in  lung  or  blood,  including  quthelial 
cells  (E-cadherin),  fibroblasts  (FSP),  white  blood  cells  (CD1I,  CD4)  and  red 
blood  cells  (glycophorin  A). 

Preparation  of  enzymatic  digest  of  protein  mixtures  for  mass  spectrometric 
analysis.  The  proteins  in  P  were  solubilized  using  CLB  before  vortexing  and  boil¬ 
ing  for  5  min.  The  protein  concentration  was  determined  by  the  micro  BCA  assay 
from  Pierce.  Silica  particles  were  sedimented  by  centrifugation  to  collect  the  solu¬ 
bilized  proteins  in  the  supernatant,  which  was  then  precipitated  with  chloro¬ 
form/methanol  and  resolubilized  in  8  M  urea  adjusted  to  pH  8.5  with  100  mM 
ammonium  bicarbonate.  Protein  disulfide  bonds  were  reduced  with  2.5  mM  Tris 
(2-carboxyethyl)-phosphine  hydrochloride  (TCEP)  for  15  min  at  60  °C;  car- 
boxyamidomethylated  with  3.75  mM  iodo  acetamide  for  15  min  at  25  °C  in  the 
dark;  digested  by  a  first-site-specific  enzymatic  digestion  with  endoproteinase 
lys-C  for  4  h  at  37  °C,  1: 100  (wt/wt);  digested  for  a  second  time.  After  the  protein 
mixture  was  diluted  to  1 .6  M  urea  with  100  mM  ammonium  bicarbonate,  pH  8.5, 
it  was  digested  a  second  time  with  poroszy me- immobilized  trypsin  ( 10  pi/ 1 00  jig 
protein)  overnight  at  37  °C  (with  agitation).  The  immobilized  trypsin  was  then 
removed  by  centrifugation  and  the  peptides  in  the  supernatant  concentrated  by 
solid  phase  extraction  with  SPEC-PLUS  PTC  1 8  cartridges  (Ansys  Diagnostics) 
following  the  manufacturers  instructions.  After  lyophilizing  the  peptide  mixture 
to  near  dryness  the  samples  were  stored  in  sample  buffer  (5%  acetonitrile,  0.3% 
formic  acid)  at  a  concentration  of  5  pg/pl  (calculated  from  the  initial  amount  of 
protein)  at  -20  °C  before  mass  spectrometric  analysis. 

MudPIT.  In  each  experiment  approximately  150  jig  of  complex  peptide  mixture 
was  separated  by  2D  LC.  The  first  dimension  was  based  on  strong  cation 
exchange  and  the  second  dimension  on  C,8  reversed-phase  hydrophobic  interac¬ 
tion  chromatography.  Both  chromatographic  materials  were  packed  seamlessly 
into  a  frittless  microcolumn.  A  5-pm  tip  was  pulled  on  a  fused-silica  microcapil¬ 


lary  ( 100  pm  inner  dimension  (i.d.)  x  365  pm  outer  dimension  (o.d.))  using  a 
Model  P-2000  laser  puller  (Sutter  Instrument).  The  microcolumn  was  then 
packed  with  three  phases  of  chromatographic  material  as  follows:  8.5  cm  of  5  pm 
C)8  reversed-phase  material  (Polaris  C18-A,  Metachem),  then  4  cm  of  5  pm, 
300-A  strong  cation  exchanger  (PolyLC)  and  lastly  3.5  cm  of  C,8  material  using  a 
helium  pressure  cell  operated  at  600-900  p.s.i.  (Mass  Evolution).  After  equilibrat¬ 
ing  the  microcapillary  column  with  sample  buffer,  30  pi  of  the  complex  peptide 
mixture  ( 1 50  pg)  was  loaded  off-line  onto  the  microcapillary'  column  which  was 
then  mounted  onto  a  microcross  (Upchurch  Scientific)  and  placed  in-line  with 
an  Agilent  1 100  quaternary  HPLC.  Peptides  were  directly  eluted  into  the  mass 
spectra  meter  using  2D  LC  with  1 8  step-elutions  from  the  strong  cation  exchanger 
followed  by  a  gradient  elution  of  the  reversed-phase  material  '0.  The  buffer  solu¬ 
tions  used  were:  buffer  A,  5%  (vol/vol)  acetonitrile  and  0.1%  (vol/vol)  formic 
acid;  buffer  B,  80%  (vol/vol)  acetonitrile  and  0.1  %  (vol/vol)  formic  acid;  buffer 
C,  500  mM  ammonium  acetate,  5%  (vol/vol)  acetonitrile  and  0.1%  (vol/vol) 
formic  acid.  Step  1  consisted  of  a  1 10-min  gradient  from  0  to  100%  buffer  B  and 
was  followed  by  steps  2-18.  Each  step  consisted  of  a  135-min  gradient  including 
pregradient  salt  steps  with  increasing  concentrations  of  ammonium  acetate.  The 
profile  of  steps  2-18  was:  5  min  100%  buffer  A;  3  min  x%  buffer  C  (x%  corre¬ 
sponding  to:  25,37.5,50,58.5,67,75,83.5,91.5, 100, 125, 150, 175,200, 225,250 
and  500  mM  ammonium  acetate);  5  min  1 00%  buffer  A;  a  5-min  gradient  to  15% 
buffer  B;  a  60-min  gradient  to  45%  buffer  B;  a  32-min  gradient  to  100%  buffer  B; 
5  min  100%  buffer  B;  a  10-min  gradient  to  100%  buffer  A;  and  10  min  isocratic 
re-equilibration  at  100%  buffer  A.  The  100  pl/min  flow  rate  at  the  HPLC  was 
reduced  400-fold  by  a  fused  silica  capillary  (50  pm  i.d.,  365  pm  o.d.)  splitter 
placed  on  the  microcross. 

Mass  spectrometric  measurements  were  done  on  a  LCQ  Deca  XP  ion  trap 
mass  spectrometer  (ThermoFinnigan)  equipped  with  a  modified  microelectro¬ 
spray  ionization  source  from  Mass  Evolution.  A  spray  voltage  of  2.0  kV  was 
applied  precolumn  at  the  liquid  junction  of  the  microcross  as  described'7. 
Operation  of  the  quaternary  Agilent  1 100  HPLC  pump  and  the  mass  spectrome¬ 
ter  was  fully  automated  during  the  entire  procedure  using  the  Excalibur  1.2  data 
system  (ThermoFinnigan).  Continuous  cycles  of  one  full  scan  (m/z  400  to  1 400) 
followed  by  three  data-dependent  MS/MS  measurements  at  35%  normalized  col¬ 
lision  energy  were  done.  MS/MS  measurements  were  allowed  for  the  three  most 
intense  precursor  ions  with  an  enabled  exclusion  list  of  25  m/z  values  (±1 .5  Da)  or 
a  maximum  time  limit  of  5  min.  The  zoom  scan  function  to  determine  the  charge 
state  was  disabled  to  increase  the  duty  cycle  of  the  instrument. 

Database  search  and  analysis  of  tandem  mass  spectra.  MS/MS  spectra  were 
extracted  from  raw  files  requiring  a  minimum  of  2 1  signals  with  an  intensity  of  at 
least  7.5  x  104  arbitrary  units.  Extracted  MS/MS  spectra  were  automatically 
assigned  to  the  best  matching  peptide  sequence  using  the  SEQUEST  algorithm'8 
and  the  Sequest  Browser  software  package  (ThermoFinnigan).  SEQUEST 
searches  were  done  on  independent  dual  processor  (2.0-2.6  GHz)  personal  com¬ 
puters  against  a  rat  protein  database  containing  40,800  protein  sequences  down¬ 
loaded  as  FASTA-formatted  sequences  from  Entrez  (NCBI;  http://www.ncbi.nlm. 
nih.gov/entrez).  Sequence  redundancies  were  removed  using  Perl  script.  To 
increase  the  search  speed  the  protein  database  was  preprocessed  to  create  a  binary 
database  containing  all  possible  tryptic  peptides  of  the  searched  database.  Any 
static  and  dynamic  modifications  that  would  account  for  post-translational  and 
chemical  modifications  were  neglected.  The  peptide  mass  search  tolerance  was 
set  to  3  Da.  Spectral  matches  were  retained  with  a  minimal  cross-correlation 
score  (XCorr)  of  1.5,  2.2  and  3.3  for  charge  states  +1,  +2  and  +3  respectively. 
DeltaCN  (top  match’s  XCorr  minus  the  second-best  match’s  XCorr  divided  by 
top  match’s  XCorr)  had  to  be  >0.07.  Retained  spectral  matches  were  filtered  and 
reassigned  to  proteins  using  DTASelect'9.  DTASelect  outputs  of  indejXMident 
measurements  were  entered  into  Accessible  Vascular  Targets  database  (AVATAR). 
AVATAR  was  designed  to  store  a  large  amount  of  mass  spectrometric  data  and  to 
provide  tools  to  analyze  the  data  to  extract  valuable  information.  We  used  rela¬ 
tional  models  for  database  design  based  on  Entity-Relationship  and  implemen¬ 
ted  the  database  in  the  MySQL  relational  database  management  system  to  support 
database  query  and  management.  This  relational  database  plus  Perl-based  user- 
friendly  interface  have  greatly  improved  data  organization,  data  consistency  and 
integrity,  and  facilitated  data  comparison  and  information  retrieval.  To  increase 
coverage  of  known  EC  markers  in  some  of  the  analysis,  we  also  used  a  larger  data¬ 
base  containing  over  200,000  protein  sequences  (human,  mouse  and  rat). 
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Removal  of  protein  redundancy.  AVATAR  allowed  a  detailed  analysis  of  parame¬ 
ters  such  as  reproducibility  and  combined  sequence  coverage.  The  final  protein 
list  created  from  all  peptide  sequences  obtained  by  MS/MS  was  compiled  using 
AVATAR  to  retrieve  the  minimum  number  of  proteins  identified  by  three  or  more 
peptide  spectra.  If  multiple  entries  of  the  same  protein  from  different  species 
(that  is,  mouse  and  human)  other  than  rat  were  present  only  the  species  with  the 
better  spectrum  coverage  was  manually  retained.  If  it  was  not  unambiguously 
possible  to  distinguish  multiple  isoforms  or  splice  variants  of  a  protein  being 
assigned  to  the  same  set  of  peptides  from  each  other,  we  manually  reduced  this 
list  to  one  primary  entry. 

Analysis  of  the  reproducibility  of  MudPIT  measurements.  Twelve  independent 
18-step  MudPIT  experiments  of  rat  lung  P  sample  were  statistically  analyzed  to 
evaluate  the  benefit  of  repetitive  MudPIT  measurements.  After  each  additional 
experiment  we  determined  the  fraction  of  proteins  that  were  represented  in  at 
least  two  experiments  and  the  percentages  of  proteins  that  were  present  in  only 
one  single  experiment.  Because  this  analysis  can  depend  on  the  sequence  by 
which  the  experiments  are  analyzed,  we  limited  potential  bias  from  any  particular 
order  of  experiments  by  analyzing  12  different  experiment  sequences  where  the 
order  of  the  experiments  was  systematically  permutated  by  shifting  the  first 
experiment  to  the  last  position. 

In  silico  protein  analysis.  The  molecular  weights  and  pis  of  each  protein  based  on 
its  primary  amino  acid  sequence  were  calculated  using  the  Expasy  web  server 
(http^/us.expasy.org/tools/pi_tool.html).  The  actual  molecular  weights  and  pis 
may  differ  slightly  from  the  calculated  values  because  post-translational  modifi¬ 
cations  are  not  considered  by  the  program  at  this  time.  SwissProt 
(http://us.expasy.org/sprot/sprot-top.html)  and  the  National  Center  Biotech¬ 
nology  Information  (NCBI;  http://www.ncbi.nlm.nih.gov/entrez/query.fcgi) 
protein  and  literature  databases  were  used  to  classify  proteins  according  to  their 
currently  known  predominant  subcellular  location.  Known  or  predicted  trans¬ 
membrane-spanning  alpha  helices  were  determined  through  the  literature  or  by 
using  the  web -based  prediction  program  TMHMM  v2.0(provided  by  the  Center 
for  Biological  Sequence  Analysis  of  the  Technical  University  in  Denmark, 
http^/www.cbs.dtu.dk/services/TMHMM-2.0).  Only  100%  probabilities  were 
taken  into  consideration.  Searches  for  protein  families  and  hidden  Markov  mod¬ 
els  were  done  using  pfam  (http://www.sanger.ac.uk/software/pfam/index.html). 

Note:  Supplementary  information  is  available  on  the  Nature  Biotechnology  website. 
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Diversity  of  identified  proteins.  Proteins  associated  with  the  plasma  membrane  that  are  not  integral 
membrane  proteins  or  lipid-anchored  may  exist  in  our  preparation  through  protein  complex  formation 
and  interaction  partners.  For  example,  sodium  hydrogen  exchanger  regulatory  factor  2  (NHERF-2)  is  not 
membrane-bound  but  rather  associates  with  podocalyxin,  a  pan-endothelial  integral  membrane  protein 
marker1,  via  its  PDZ  domain2.  NHERF-2  in  turn  interacts  with  ezrin  and  EBP50-PDZ  interactor3,  both  of 
which  were  also  identified  in  rat  lung  P.  Another  example  is  dedicator  of  cytokinesis  protein  I 
(Dock  1 80),  which  is  recruited  to  the  plasma  membrane  via  its  interaction  with  phosphatidylinositol 
3,4,5-triphosphate4  where  it  has  been  implicated  in  regulation  of  cytoskeletal  rearrangements  required 
for  cell  motility5.  Moreover,  its  activation  has  been  found  to  be  enhanced  by  binding  to  engulment  and 
cell  motility  proteins  (ELMO)  1  and  2,  also  found  in  P.  Such  similar  interactions  may  explain  the 
presence  of  many  cytosolic  and  even  other  proteins  in  our  preparation  because  of  yet  to  be  discovered 
recruitments. 

Distinct  EC  surface  proteome  in  vivo  vs.  in  vitro.  The  tight  junction  protein  ZO-2  is  readily  detected 
by  MS  in  rat  lung  P  but  not  in  RLMVEC.  Western  analysis  (Fig.  6a)  and  immunofluorescence  (our 
unpublished  observations)  confirms  the  expression  of  ZO-2  in  RLMVEC  cells.  It  is  in  the  total  cell 
lysate  (H)  and  not  P.  The  lack  of  ZO-2  and  other  junctional  complex  proteins  (VE-cadherins  and  ZO-1) 
in  RLMVEC  P  but  not  rat  lung  P  suggest  that  the  junctional  complex  remains  in  P  from  tissue  but  not 
cultured  EC  monolayers.  The  intercellular  junctions  of  ECs  in  vivo  are  more  developed  and  tighter  than 
those  in  culture  so  that  during  homogenization  of  the  lung  tissue,  membrane  breakage  may  occur  to 
include  along  with  the  rest  of  the  silica-coated  plasma  membranes  only  the  junctions  that  are  tightly 
interdigitated. 

Novel  findings.  In  addition  to  identifying  multiple  known  plasma  membrane  and  EC  marker  proteins, 
we  also  detected  a  number  of  proteins  that  were  unexpected,  such  as  TIE-2,  APN,  TEM4,  TEM6, 
ICAM-I,  and  nucleolin.  TIE2  and  ICAM-1  expression  in  lung  endothelium  was  also  validated  by  tissue 
immunostaining  (Fig.  6b)  whereas  APN  and  nucleolin  were  detected  by  Western  analysis  enriched  in 
lung  P.  These  molecules  are  all  thought  to  be  endothelial  markers  induced  during  angiogenisis  and/or 
inflammation  despite  being  clearly  present  in  rat  lung  P  from  multiple  samples.  Given  the  mechanical 
forces  involved  with  respiration,  lung  endothelium  may  undergo  recurrent  damage  requiring  more  self¬ 
renewal  and  be  less  quiescent  than  in  other  tissues.  Conversely,  the  inherent  highly  oxygenated  state  of 
the  tissue  would  render  unlikely  typical  triggers  of  angiogenesis,  such  as  hypoxia.  In  the  end,  strategies 
of  targeting  tumors  through  these  proteins  should  assess  possible  lung  targeting  in  vivo. 
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We  also  identified  two  neuronal  receptors,  P2Z  receptor  (P2X7)6  and  Trk-A7.  Although  Trk-A  (isoform 
I)  can  be  expressed  in  a  wide  range  of  non-neuronal  tissues  such  as  connective  tissue  (fibroblasts), 
kidney,  lung,  breast  and  esophagus8,9,  to  the  best  of  our  knowledge  this  is  the  first  time  that  either  of 
these  neuronal  cell  surface  receptors  are  detected  at  the  EC  surface  (Fig.  6).  Given  the  enrichment  in  P 
and  the  lack  of  previous  detection  in  lung  tissue6,  their  EC  surface  expression  appears  real  and  may 
reflect  the  high  sensitivity  of  this  analytical  approach.  Other  proteins,  such  as  aquaporin  and 
synaptobrevin,  previously  thought  to  be  specific  for  other  cell  types  (epithelial  and  neuronal, 
respectively)  have  also  been  discovered  in  endothelium  using  similar  tissue  subfractionation 
techniques10  ".  The  P2X7  andtibody  is  effective  for  both  Western  analysis  and  tissue  immunostaining 
so  that  we  have  confirmed  P2X7  expression  on  lung  endohtelium  on  frozen  tissue  sections. 

In  addition  to  known  proteins,  we  also  detected  a  number  of  yet-to-be-characterized  proteins 
(25%).  This  group  includes  proteins  discovered  through  EST  and  cDNA  cloning  initiatives  as  well  as 
proteins  that  are  “similar  to  a  known  gene  from  the  human  or  mouse  database’"  predicted  by  automated 
annotation  of  the  rat  genome.  These  predicted  proteins  require  further  structural  and  functional 
characterization  before  they  can  be  classified  outside  this  category.  We  chose  to  generate  specific 
antibodies  to  two  proteins  (FLJ 10849  and  HSPAI2B)  in  this  category  for  validation  of  the  MS  findings. 
FLJI0849  is  a  hypothetical  protein  supported  by  expressed  sequence  tag  (EST)  evidence  that  was 
identified  with  good  sequence  coverage  (28.6%;  18  spectra;  Supplementary  Fig.  1  and  Supplementary 
Table  4  online).  A  BLAST12  search  of  the  sequence  revealed  its  closest  relative  to  be  human  septin  6 
with  81%  identity  and  a  pfam  search  showed  that  it  contained  a  CDCGTP  motif  that  is  common  to 
proteins  involved  in  cell  division  as  well  as  in  endocytosis  or  GTP-driven  vesicle  formation13. 
HSPA12B,  identified  in  rat  lung  P  (26  spectra,  28.4%  coverage  of  the  predicted  primary  sequence),  is  an 
apparent  member  of  the  Hsp70  super  family  (see  Supplementary  Fig.  2  and  Supplementary  Table  5 
online).  HSPA12B  is  supported  by  EST  findings  and  Northern  analysis  reveals  expression  in 
atherosclerotic  lesions14.  HSPAI2B  expression  was  also  confirmed  in  the  lung  endothelium  by  tissue 
immunostaining  (Fig.  6b)  but,  unfortunately,  the  FLJ  1 0849  antibody  was  suboptimal  and  gave 
significant  background  staining.  They  are  both  enriched  in  P  relative  to  H  so  that  it  is  very  likely  based 
on  our  experience  that  they  both  are  indeed  expressed  at  the  lung  EC  surface.  To  the  best  of  our 
knowledge  this  is  the  first  time  that  the  translation  of  these  2  proteins  has  been  confirmed  on  a  primary 
sequence  level. 

Protein  validation.  Given  that  a  protein  not  identified  by  MS  out  of  a  complex  mixture  is  not 
necessarily  absent  in  the  sample,  we  have  used  Western  analysis  to  test  for  many  proteins  not  detected  in 
RLMVEC  P.  87.5%  of  the  tested  proteins  that  are  not  detected  in  RLMVEC  by  MS  are  also  not  detected 
by  Western  analysis  and  91%  of  all  tested  proteins  that  are  detected  in  either  rat  lung  or  RLMVEC  P  are 
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confirmed  by  Western  analysis  (see  Supplementary  Table  3  online).  All  detected  protein  bands  agree 
with  the  predicted  molecular  weights  and,  given  the  relative  specificity  of  each  antibody  this  analysis 
provides  good  evidence  that  the  peptides  identified  by  MS  truly  corresponded  to  the  identified  proteins. 
Yet,  it  remains  problematic  that  the  public  protein  databases  are  as  yet  incomplete  so  that  we  cannot  rule 
out  the  possibility  for  the  existence  of  other,  yet-to-be-identified  proteins  that  share  peptide  sequences. 
Western  analysis  also  has  the  advantage  of  providing  quantitative  information  on  the  relative  levels  of 
each  protein.  Proteins  depleted  in  P  relative  to  total  cell/tissue  lysate  (H)  have  a  greater  probability  of 
being  contaminants  revealed  by  the  high  sensitivity  of  the  mass  spectrometer  whereas  proteins  enriched 
in  P  over  H  are  expected  rarely  to  be  contaminants. 

Immunohistochemical  staining  of  lung  tissue  can  provide  additional  validation  beyond 
subfractionation  to  show  clearly  lung  EC  expression  of  identified  proteins.  Unfortunately,  many 
antibodies  do  not  work  both  in  Western  analysis  and  immunohistology.  Proteins  are  denatured  to 
varying  degrees  after  SDS-PAGE  and  transfer  to  filters  whereas  proteins  in  frozen  tissue  may  be  more 
vative  in  structure  and  may  be  interacting  with  other  proteins  to  block  possible  antibody  reactivity. 
Tissue  immunostaining  can  be  eextremely  laborious  and  time  consuming  requiring  careful  preparation 
and  optimization.  In  our  experience  of  testing  >500  antibodies  in  the  last  decade,  >  50%  of  all  the 
antibodies  that  we  get  either  from  other  labs  or  from  commercial  sources  are  not  monospecific  by 
Western  analysis  and  thus  cannot  be  used  for  immunohistochemistry:  at  least  50%  of  these  monospecific 
antibodies  do  not  work  for  tissue  immunostaining. 
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Supplementary  Table  3:  Summary  of  Western  analysis 


Protein  name 

Rat  Lung 

H  P 

Cell  Culture 

H  P 

MS 

Lung  RLMVEC 

P  P 

MS  result 

confirmed 

Angiotensin-converting  enzyme 

+ 

0  0 

yes  no 

yes 

Aquaporin  1 

+ 

0  0 

yes  no 

yes 

Carbonic  anhydrase  IV 

+ 

0  0 

yes  no 

yes 

Endothelin  converting  enzyme 

0  + 

0  0 

yes  no 

yes 

High  affinity  nerve  growth  factor  receptor 

0  + 

- 

yes  no 

no 

Lyn  B  tyrosine  kinase 

0  + 

0 

yes  no 

no 

Nudeoiin 

+ 

0  0 

HHI  no 

yes 

P2Z  receptor 

+ 

0  0 

yes  no 

yes 

Podocalyxin • 

+ 

o  o 

yes  no 

yes 

Receptor  for  advanced  glycosylation  end  products 

+ 

0  0 

yes  no 

yes 

Seven  transmembrane  receptor 

+ 

0  0 

yes  no 

yes 

Sodium-hydrogen  exchanger  3  regulatory  factor  2 

0  + 

0  0 

yes  no 

yes 

Tight  junction  protein  2 -Z02 

0  + 

+  0 

MHHi  no 

yes 

Tyrosine-protein  kinase  receptor  TIE-2 

+  + 

0  0 

yes  no 

yes 

HSPA12B  (HSP70  motif  containing  protein) 

0  + 

0  0 

mmm  m h 

yes 

Vascular  endothelial  cadherin 

0  + 

0  0 

yes  no 

yes 

Aminopeptidase  N 

+  + 

0  + 

yes  yes 

yes 

Caveolin-1 

+ 

+  + 

yes  yes 

ve^^ 

Heat  shock  protein  47 

+  + 

+  + 

yes  yes 

yes 

Hypothetical  protein  FUJI 0849  (septin  homolog) 

+  + 

+ 

yes  yes 

yes 

+ 

+  + 

l/pc  i/po 

yes 

Ox-45 

+ 

0  0 

yes  yes 

no 

5-nucleotidase 

o  + 

+  + 

no  yes 

yes 

The  table  summarizes  the  results  from  Western  analysis  in  comparison  to  MS  analysis.  Proteins  are  listed  in  alphabetical 
order  in  three  categories.  The  three  categories  are  i)  not  found  by  MS  in  RLMVEC  P;  ii)  found  by  MS  in  RLMVEC  P  and  rat 
lung  P;  iii)  not  found  by  MS  in  rat  lung  P.  Open  circles  indicate  no  signal,  minus  signs  indicate  lower  signal  intensity  and 
plus  signs  indicate  higher  signal  intensity. 
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Supplementary  Figure  1 :  MS/MS  spectrum  of  peptide  of  FLJ10849 

nmQLLQSQflQQSGMMnc  ey 

Calculated  IIMM*  a  1957.01  i 

Calculated  lltaSU^  ■  973.01 


Tandem  mass  spectrum  of  the  doubly  charged  tryptic  peptide  MAAAQLLQSQAQQSGAOOTK"  corresponding  to  amino 
acid  positions  400  -  418  in  septin  homolog  FLJ 10849  (NCBI:  gi|34876531|).  The  overlapping  amino  acid  sequence 
"QLLQSQAQQSGAQQTKKD"  corresponding  to  positions  403  -  420  was  used  as  antigen  for  the  production  of 
polyclonal  antibodies  in  rat.  Mass  spectral  fragment  ions  cover  63.9%  of  the  peptide  sequence.  The  calculated  precursor 
m/z  (979.01 )  differs  by  0.89  mass  units  from  the  observed  m/z  of  the  precursor  ion  (979.90). 


Supplementary  Table  4:  Peptide  and  spectrum  coverage  of  FLJ10849 


Septin  6  homolog  FIJI 0849 


Charge  state 


position 

sequence 

i 

2 

3 

(15  - 

34) 

NLSLSGHVGFDSLPDQLVNK 

x/o 

(55  - 

65) 

STLMDTLFNTK 

x/o 

x/o 

(66  - 

80) 

FESDPATHNEPGVRL 

x/o 

(84  - 

93) 

SYELQESNVR 

X 

X 

(96  - 

110) 

LTIVDTVGFGDQINK 

o 

(111 

-  130) 

DDSYKPIVEYIDAQFEAYLQEELK 

o 

(138 

-  146) 

SLFNYHDTR 

x/o 

x/o 

(163 

-  170) 

SLDLVTMK 

x/o 

(176 

-  184) 

VNIIPIIAK 

x/o 

x/o 

(275 

-  286) 

EMLIRVNMEDLR 

X 

(293 

-  298) 

HYELYR 

x/o 

X 

(309 

-  326) 

DTDPDSKPFSLQETYEAK 

X 

(327 

-  336) 

RNEFLGELQK 

o 

x/o 

(399 

-  418) 

KAAAQLLQSQAQQSGAQQTK 

X 

(400 

-  418) 

AAAQ  L  LQ  S  QAQQ  SGAQQT  K 

x/o 

o 

Tryptic  peptides  of  Septin  homolog  FLJ10849  (NCBI:  gi|34876531|)  identified  by  MS/MS  in  rat  lung  P  are  listed  with 
their  primary  sequence  positions  and  the  precursor  charge  state.  The  sequence  coverage  per  amino  acid  is  28.6%.  (x) 
indicates  peptides  identified  in  rat  lung  P  and  (o)  indicates  peptides  identified  in  RLMVEC  P. 
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Supplementary  Figure  2: 


(A)  MS/MS  spectrum  peptide  of  HSPA12B 


Tandem  mass  spectrum  of  the  doubly  charged  tryptic  peptide  "EAAYLAGLYSR"  corresponding  to  amino  acid  positions 
246  . .]-28 1  in  HSPAI2B  Ratlus  norvegicus  (NCBt:  gi|34858716|  or  amino  acid  positions  221  -  231  of  HSPA12B  Mus 
musculus  (NCBI:  gi|2 131 2236| )  found  in  RLMVEC  P  sample.  Mass  spectral  fragment  ions  cover  90.0%  of  the  peptide 
sequence.  The  calculated  precursor  m/z  (575.32)  differs  by  0.04  mass  units  from  the  observed  m/z  of  the  precursor  ion 
(575.29). 
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Supplementary  Figure  2: 


(B)  Sequence  coverage  of  HSPA12B 

Comparison  of  HSPA12B  rat  sequence  with  mouse  sequence  (KIAA0417) 


1  HSPA12B 

2  KIAA0417 
3Consensus 


mltv; 

MLTV1 

MLTV! 


10  20  30  40  50 

U  I _ I _ I _ I _ L 

TVPEMGLQGLYISSSPERSPVPSPPGSPRTQESCGIAPLTPSQSPKP; 

PEMGLQGLYISSSPERSPVPSPPGSPRTQESCGIAPLTPSQSPK] 
TVPEMGLQGLYISSSPERSPVPSPPGSPRTQESCGIAPLTPSQSPKP; 


60 


70 

1 


'SWVAIDFGTTSSGYAFSFi 

FSVWAIDFGTTSSGYAFSF, 

'SVWAIDFGTTSSGYAFS 


80 

. 


FAS 

F*| 


80 

80 


1  HSPA12B 

2  KIAA04 17 
3Consensus 


90  100  110  120  130  140  150  160 

PEAIHMFj|KWEGGDPGVAHQ^rPTCLLLTPEGIFHSFGYTA^DYYHDLDPEEAI  I  'Ml  il  I  iMllI  IIH  I 

PEAIHMMRKWEGGDPGVAHQKTPTCLLLTPEGIFHSFGYTARDYYHDLDPEEARDWLYFEKFKMKIHSATDLTLKTQL] 
PEAI HMMRKWEGGDPGVAHQKTPTCLLLTPEG I FHS FGYTARDYYHDLDPEEARDWLYFE KFKMKIHS ATDLT] 


160 

160 


1  HSPA12B 

2  KIAA04 17 
3Consensus 


1  HSPA12B 

2  KIAA04 17 
3Consensus 


1  HSPA12B 

2  KIAA04 17 
3Consensus 


1  HSPA12B 

2  KIAA04 17 
3Consensus 


1  HSPA12B 

2  KIAA04 17 
3Consensus 


1  HSPA12B 

2  KIAA0417 
3Consensus 


1  HSPA12B 

2  KIAA04 17 
3Consensus 


1  HSPA12B 

2  KIAA04 17 
3Consensus 


170 


180 


190 


200 


210 


220 


230 


240 


FFKE] 


■■^lalevfahalrB 

VNGKKMLALEVFAHALRFF 

.vngkkmlalevfahalrffkeha: 


’KEHALQt 

’KEHALQj1- 

’kehalqL 


PERAENASDQLS  PYGPFPTP: 


ptppt: 


EQSECMLE! 

:cmle; 
ecmle: 


VLTVPAIW 


vrwvltvpaiwkq: 

VR WVLTVP A I W] 


250 


260 


270 


280 


290 


300 


310 


320 


gEAA' 

EAAYL 


-AGLVI 


kll I alepeaasvycrklr;  : ; . ; : .  y  ;  - :  i 

KLL I ALE P E AAS VYCRKLRLHQLMDLS  S  RTA 


'MREAAYLKRWEQ  PD  PRA  PS  YMTVYG  AGHTY  PQAGLVSREDAEKLL  I ALE PEAAS VYCRKLRLHQLMDLS : 


330 


340 


350 

I 


360 

I 


370 


380 


390 


400 


3ERRSIDSSFRHAREQLRRSRHSR  v  v  YMVADCGGGTVDLTVHQLEQPHGTLKELYKASGGP 

ERRS I DSS FRHAREQLRRSRHSRTFLVEAGVGELWAEMQEGDRYMVADCGGGTVDLTVHQLEQPHGTLKELYKASGG P 
3IDSSFRHAREQLRRSRHSRTFLVESGVGELWSEMQEGDRYMVADCGGGTVDLTVHQLEQPHGTLKELYKASGGP 


410  420  430  440  450  460  470  480 

1  I  I  I  1  I 

YGAVGVDLAFEQLLCRI FGEDF I AKFKRQRPAAWVDLT I AF EARKRTAGPMAG ALN I SLP FSF I DFYR^BGHNVETAL  4  80 
YGAVGVDLAFEQLLCRI FGEDF I AKFKRQRPAAWVDLTI AFEARKRTAGPHRAGALNI SLPFSF IDFYRKQRGHNVETA®  4  ^  0 
YGAVGVDLAFEQLLCRI FGEDF I AKFKRQRPAAWVDLTIAFEARKRTAGPHRAGALNISLPFSFIDFYRKQRGHNVETAL 


490 


500 

1 


510 


520 


530 


540 


550 


560 


3SCGMLRMSCEAMNELFQPTVSGIIQHIEMLLAKPEVQGVKLLFLVGGFAESAVLQHAVI 
3SCGMLRMSCEAMNELFQPTVSGI IQHI EMLLAKPEVQGVKLLFLVGGFAESAVLQHAVI 
2EAMNELFQPTVSGI IQHIEMLLAKPEVQGVKLLFLVGGj 


570  580  590  600  610  620  630  640 

VVPHDVGLTILKGAVLFGQAPGVVRBfeSPLTYGVGVLNRFVPGHHPPEKLLVRDGRRWCTDVFERFVAAEQSVALGEEvI  64  0 

WPHDVGLTILKGAVLFGQAPGWRVRRSPLTYGVGVLNRFVPGHHPPEKLLVRDGRRWCTDVFERFVAAEQSVALGECT  5  90 
WPHDVGLTILKGAVLFGQAPGWRVRRSPLTYGVGVLNRFVPGHHPPEKLLVRDGRRWCTDVFERFVAAEQSVALGEEV 


650 


660 


670 


680 


690 


PAR  PGQRR  VL I  NLYCCAAEDAFlFITDPGVR 


700 

1 


710 


720 

I 


5LELEPEGSPENTGTSPSRRBIRAAMQFGDTE I KVTAVDV 
S YCPARPGQRR VL INLYCCAAEDARF ITDPGVRKCGALSLELEPEGCPENTGTS  PSRREI RAAMQFGDTE I KVTAVDV 
RVLINLYCCAAEDARFITDPGVRKCGALSLELEPEGSPBNTGTSPSRREIRAAMQFGDTEIKVTAVDV 


730  740  750  760  770  780  790  800 


Amino  acid  sequences  of  HSPA12B  of  Rattus  norvegicus  (NCBI:  gi|34858716|)  and  Mus  musculus  (NCBI: 
gi|2 13 12236|)  are  aligned  and  the  tryptic  peptides  identified  ty  tandem  mass  spectra  are  highlighted  in  yellow.  The 
sequence  coverage  per  amino  acid  is  28.4%.  Peptide  EAAYLAGLVSR  (position  246  -[...]-281  in  rat  and  highlighted  in 
green)  is  specific  for  the  mouse  sequence  of  HSPA12B  (positions  221  -  231).  It’s  occurrence  in  rat  lung  P  strongly 
suggests  a  sequence  conflict.  See  also  supplemental  Table  5. 
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Supplementary  Table  5:  Peptide  and  spectrum  coverage  of  HSPA12B 


HSPA12B 


position 

sequence 

Charge  state 

1  2  3 

(90  - 

101) 

KWEGGD  PGVAHQ 

X 

(91  - 

101) 

WEGGDPGVAHQ 

X 

(124 

-  135) 

DYYHDLDPEEAR 

X 

(136 

-  142) 

DWLYFEK 

X 

(147 

-  157) 

IHSATDLTLK 

X 

(167 

-  178) 

MLALEVFAHALR 

X 

(228 

-  238) 

WVLTVPAIWK 

X 

(246 

-  [...]  -  281) 

EAAYL - 

-  -AGLVSR 

X 

(305 

-  314) 

LHQLMDLSSR 

X 

X 

(346 

-  366) 

TFLVESGVGELWSEMQEGDR 

(453 

-  469) 

AGALNISLPFSFIDFYR 

X 

(473 

-  481) 

GHNVETALR 

X 

X 

(559 

-  572) 

VWPHDVGLTILK 

X 

X 

(573 

-  586) 

GAVLFGQAPGWR 

X 

X 

(588 

-  600) 

RS  PLTYGVGVLNR 

X 

(589 

-  600) 

SPLTYGVGVLNR 

X 

X 

(601 

-  610) 

FVPGHHPPEK 

X 

(668 

-  676) 

FITDPGVRK 

X 

(704 

-  714) 

AAMQFGDTEIK 

X 

(715 

-  724) 

VTAVDVSTNR 

X 

X 

Tryptic  peptides  of  HSPAI2B  (NCBI:  gi|34858716|)  identified  by  MS/MS  in  rat  lung  P  are  listed  with  their  primary 
sequence  position  and  the  precursor  charge  state.  The  sequence  coverage  per  amino  acid  is  28.4%.  Peptide 
EAAYLAGLVSR  (position  246  -[..J-281)  is  specific  for  the  mouse  sequence  of  HSPA12B  (NCBI:  gi|21312236| 
position  221  -  231).  See  also  supplemental  Figure  2. 


Supplementary  Table  I 


Integral  and  lipid-anchored  plasma  membrane  proteins 


SwissProt  or 


NCBI 


Number  of 
MS/MS  spectra 


Protein  name 

locus 

Trembl  locus 

Lung  P 

RLMVEC 

PI 

MW 

Function 

TMs 

ADP/ATP  translocase  2 

728810 

Q09073 

3 

7 

9  74 

32901 

transport 

2 

ADP-nbosyl  cyclase  1  (CD38H) 

2493428 

Q64244 

4 

3 

883 

34436 

other 

1 

ALCAM  (CD166) 

2589007 

035112 

3 

5.77 

65022 

adhesion 

1 

Ammopeptidase  N 

113750 

PI 5684 

3 

4 

530 

109317 

ee 

1 

Angiotensin-converting  enzyme  (CD134) 

11493661 

PI 2821 

44 

595 

149715 

ee 

1 

Antigen  peptide  transporter  2 

407477 

P36372 

3 

2 

627 

77713 

transport 

5 

Aquaponn  1 

312924 

P29975 

3 

- 

7.70 

28830 

transport 

6 

Atnal  natnuretic  peptide  receptor  A 

204270 

PI 8910 

5 

- 

6  59 

118951 

signaling 

1 

Carbonic  anhydrase  IV 

544726 

P48284 

11 

- 

631 

35076 

ee 

GPI 

Caveolin-1 

17017233 

Q8VIK9 

13 

5 

530 

20553 

traffick 

1 

Caveolin-2 

17017237 

Q8VIK7 

4 

1 

565 

18175 

traffick 

2 

CD9  antigen 

729088 

P40241 

3 

2 

725 

25084 

adhesion 

4 

CDC42 

31542368 

Q8CFN2 

3 

2 

5  76 

21311 

signaling 

prenylate 

Cell  surface  glycoprotein  MUC18 

10566949 

X 

5 

- 

5  74 

71373 

other 

1 

Cerebral  protein- 11 

34880202 

075069 

5 

- 

5  87 

81692 

other 

2 

Chloride  intracellular  channel  5 

12232044 

Q9EPT8 

12 

- 

5.64 

28299 

transport 

channel 

Complement  component  Clq  receptor  (entothelial-CD93  antigen) 

9886763 

Q9ET61 

4 

- 

4  87 

68782 

other 

1 

C-Yes 

12539401 

Q99PW1 

4 

5 

622 

60588 

signaling 

mynstate 

Cytolysin 

203894 

P35763 

6 

- 

8  18 

61513 

other 

channel 

Ckpeptidyl  peptidase  IV  (CD26) 

118905 

PI  4740 

31 

- 

5  87 

88003 

ee 

1 

Ecto-apyrase  (CD39) 

1754710 

P97687 

4 

- 

748 

57408 

ee 

2 

Ecto-ATPase 

203990 

PI 6573 

3 

- 

562 

57264 

ee 

1 

Endothelial  cell-selective  adhesion  molecule 

27720013 

X 

8 

- 

853 

31382 

adhesion 

1 

Endothehn  converting  enzyme 

529085 

P42893 

7 

- 

6  37 

86126 

ee 

1 

Fk>tillin-2 

4097589 

Q9Z2S9 

3 

4 

5.13 

47038 

traffick 

FYN 

1101768 

Q62844 

3 

- 

623 

60702 

signaling 

mynstate 

G  protein  beta  1  subunit 

31669 

P04901 

10 

- 

560 

37377 

signaling 

- 

G  protein  beta  2  subunit 

1730215 

P54313 

6 

- 

561 

37500 

signaling 

G  protein  G  alpha  1 1 

8925964 

Q9JID2 

10 

2 

591 

42026 

signaling 

palmitate 

G  protein  G  alpha  13 

27690213 

X 

4 

3 

838 

43811 

signaling 

palmitate 

G  protein  Gi  alpha  2 

71896 

P04897 

7 

3 

528 

40368 

signaling 

palmitate 

G  protein  Gi  alpha  3 

6980964 

P08753 

6 

1 

5.51 

40391 

signaling 

palmitate 

G  protein  Go  alpha  2 

27808491 

P30033 

6 

3 

562 

40073 

signaling 

palmitate 

G  protein  Gq  alpha 

7329187 

P82471 

8 

3 

558 

41425 

signaling 

palmitate 

G  protein  Gs  alpha 

14161099 

Q63803 

6 

2 

550 

45709 

signaling 

palmitate 

Giantin 

516826 

Q63714 

4 

• 

5  01 

364297 

other 

1 

Glut4 

4193489 

Q9Z1X1 

4 

6 

547 

121159 

transport 

1 

High  affinity  nerve  growth  factor  receptor  (TRK-A) 

549122 

P35739 

4 

- 

598 

87868 

signaling 

1 

Integrin  alpha- 1  (CD49a) 

124941 

P18614 

8 

18 

565 

130809 

adhesion 

1 

Integnn  alpha-5  (CD49c) 

34868649 

Q80YP5 

4 

6 

5.70 

115016 

adhesion 

1 

Integnn  alpha-6  (CD49e) 

13992591 

Q924W2 

4 

1 

6  53 

108807 

adhesion 

1 

Integnn  beta-1  (CD29) 

520566 

P49134 

11 

10 

5.77 

88495 

adhesion 

1 

Intercellular  adhesion  molecule- 1  (CD54) 

124100 

Q 002 38 

6 

10 

653 

60142 

adhesion 

1 

Intercellular  adhesion  molecule-2 

34873952 

- 

3 

- 

802 

31091 

adhesion 

1 

K-Ras  2B 

1172844 

P46203 

3 

1 

824 

21425 

signaling 

0 

Upoprotem  lipase 

462538 

Q06000 

3 

- 

836 

53082 

ee 

0 

Low-density  Upoprotem  receptor-related  protein  2 

13562118 

P98158 

3 

1 

503 

519276 

other 

1 

Lutheran  antigen 

10566957 

Q9ESS6 

7 

- 

5  52 

67512 

other 

1 

Lyn  tyrosine  kinase 

2105002 

Q07014 

13 

- 

6.76 

58529 

signaling 

prenylation 

Membrane-bound  aminopeptidase  P 

13560983 

Q99MA2 

25 

- 

549 

76080 

ee 

GPI 

MHC  class  1  protein,  isoform  a 

3493247 

X 

3 

- 

635 

39348 

other 

1 

MHC  class  1  protein,  isoform  b  (RT1  A1(f)  protein) 

1483574 

Q31257 

15 

1 

564 

39246 

other 

1 

MHC  class  l  protein,  isoform  c 

7453015 

X 

5 

- 

7.17 

41718 

other 

1 

MHC  class  1  protein,  isoform  d 

1263198 

X 

6 

5  55 

41473 

other 

1 

Microsomal  dipeptidase 

459933 

P31430 

7 

2 

567 

45506 

ee 

GPI 

Myoferlin 

23597920 

Q9NZM1 

9 

21 

6  19 

126162 

other 

1 

Na+K+  transporting  ATPase  alpha- 1 

114376 

P06685 

20 

18 

530 

113054 

transport 

7 

Na+K+  transporting  ATPase  beta-3 

3121778 

Q63377 

4 

2 

808 

31830 

transport 

1 

Neural  vismin-like  protein  3 

8393864 

P35333 

4 

- 

5  32 

22207 

transport 

mynstate 

Nogo-A  protein 

6822247 

Q9JK11 

4 

5 

441 

126388 

signaling 

2 
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OX-2  membrane  glycoprotein 

129300 

P04218 

4 

- 

9  02 

31088 

other 

1 

OX-45  antigen  (CD48) 

56805 

PI 0252 

9 

- 

767 

27680 

adhesion 

GPI 

P2Z  receptor  (P2X7) 

2499425 

064663 

3 

- 

830 

68392 

signaling 

2 

Plasma  membrane  calcium-transporting  ATPase  1 

92025 

PI 1505 

5 

1 

5.71 

138719 

transport 

7 

Plasma  membrane  calcium-transporting  ATPase  4 

1054879 

064542 

5 

6  17 

133094 

transport 

7 

Platelet-endothelial  cell  adhesion  molecule- 1  (CD31) 

1684839 

P97635 

4 

863 

38164 

adhesion 

1 

Podocalyxin 

4996222 

Q9WTQ2 

5 

481 

51545 

adhesion 

1 

Progesterone  receptor  component  1 

6647578 

P70580 

4 

1 

4  45 

21467 

signaling 

1 

PV-1 

5281517 

Q9WV78 

21 

880 

50033 

other 

1 

Rab-IIB 

9837357 

035509 

6 

3 

5  98 

24509 

signaling 

prenylation 

Rab-5b 

27689505 

X 

5 

2 

864 

23426 

signaling 

prenylation 

Ral-A 

131836 

P05810 

4 

1 

666 

23553 

signaling 

prenylation 

Ral-B 

310212 

P36860 

6 

1 

624 

23317 

signaling 

prenylation 

Rap-la 

27660326 

P10113 

4 

1 

6  39 

20987 

signaling 

prenylation 

Rap  1b 

595280 

062636 

4 

2 

537 

20929 

signaling 

prenylation 

Ras-like  protein  TC25 

27662306 

X 

3 

1 

9  10 

16797 

signaling 

prenylation 

Receptor  for  advanced  glycosylation  end  products 

498034 

063495 

12 

- 

583 

42664 

signaling 

1 

Seven  transmembrane  receptor 

5525078 

Q9WVT0 

4 

- 

6  46 

149446 

signaling 

7 

Tapasm 

13399280 

Q99JC6 

4 

1 

7  74 

50045 

other 

1 

Thrombomodulin 

2502062 

P07204 

6 

- 

478 

60329 

signaling 

1 

TIE-2 

34869831 

Q9QW24 

5 

- 

594 

95115 

signaling 

1 

Transmembrane  4  superfamily  member  3 

6179618 

055158 

5 

564 

25545 

signaling 

4 

VAMP-associated  protein  A 

4240462 

Q9Z270 

3 

4 

858 

27223 

traffic* 

1 

Vascular  endothelial-cadhenn 

34851204 

P55284 

7 

- 

5  12 

83275 

adhesion 

1 

Cytoskeletal  and/or  junctional  proteins 

Protein  name 

NCBI 

locus 

SwissProt  or 

Trembl  locus 

Number  of 

MS/MS  spectra 

Lung  P  RLMVEC 

PI 

MW 

Function 

predicted 

TMs 

Actin  -  alpha 

511131 

P04270 

29 

30 

523 

42019 

structural 

Actin  -  beta 

13592133 

P02570 

30 

21 

529 

41737 

structural 

Actin  -  gamma 

27714843 

X 

9 

14 

5.21 

35162 

structural 

Actinin  1-  alpha 

4210985 

Q9Z1P2 

15 

35 

5  23 

102960 

structural 

Actinn  alpha-4 

6636119 

Q9QXQ0 

9 

19 

524 

104786 

structural 

Adduan,  alpha 

1200129 

Q63028 

9 

1 

581 

80355 

structural 

Adducin.  gamma 

1041240 

062847 

4 

3 

546 

78804 

structural 

Afadin  (0 

2555011 

035889 

8 

1 

582 

207678 

structural 

Ankynn  G 

3885972 

X 

3 

1 

794 

284459 

other 

Catemn  alpha 

34878659 

X 

34 

1 

592 

105787 

structural 

Catemn  beta 

16758082 

Q9WU82 

16 

1 

5  53 

85455 

structural 

Catemn  delta- 1 

34856482 

X 

18 

1 

596 

101396 

structural 

Desmm 

1352241 

P48675 

5 

6 

521 

53326 

structural 

Desmoplakm  1 

34875216 

X 

3 

- 

5  92 

105787 

structural 

Desmusiin 

27676906 

Q810D0 

3 

- 

5.06 

140733 

structural 

Dynein,  heavy  chain 

31377489 

P38650 

5 

30 

604 

532025 

structural 

Epiplakin  1 

34867002 

X 

6 

3 

565 

368204 

structural 

ERM-bmding  phosphoprotein 

8132349 

Q9JJ19 

12 

1 

5  70 

38830 

structural 

Ezrin 

17902245 

X 

60 

21 

5  59 

138687 

structural 

F -actin  capping  protein  alpha- 1 

34859736 

X 

5 

3 

5.43 

32910 

structural 

F -actin  capping  protein  alpha-2 

34883261 

X 

5 

3 

5  57 

32967 

structural 

F-actin  capping  protein  beta 

34872216 

X 

7 

3 

569 

30629 

structural 

Filamin  1 

34881882 

X 

61 

81 

567 

279442 

structural 

Fodnn  alpha  chain 

3462887 

PI 6086 

227 

109 

520 

284638 

structural 

Gelsolin 

34853856 

X 

9 

4 

622 

84946 

structural 

Junction  plakoglobin 

1497985 

PI 4923 

16 

5  95 

81498 

structural 

Kinesin  heavy  chain  58 

27687147 

X 

5 

5 

6  25 

97300 

other 

Kinesin  heavy  chain  5C 

34854278 

X 

6 

3 

599 

114413 

other 

Microtubule-associated  protein  IB 

1083716 

X 

5 

2 

4.73 

259025 

structural 

Microtubuleassociated  protein  4 

34866431 

X 

8 

7 

5  29 

303149 

structural 

Moesin 

2218139 

X 

94 

41 

5.75 

206407 

structural 

Myosin  heavy  chain,  smooth  muscle 

92503 

X 

7 

5 

5.19 

47803 

structural 

Myosin  heavy  chain-A,  nonmuscle 

967249 

X 

142 

139 

549 

226338 

structural 

Myosin  heavy  cham-B.  nonmusde 

7381235 

X 

51 

112 

549 

228965 

structural 

Myosin  I  beta 

13431669 

Q 05096 

10 

4 

942 

131918 

structural 

Myosin  1  gamma 

23821864 

Q63357 

4 

3 

945 

116137 

structural 
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Myosin  light  chain  1 

56670 

X 

6 

- 

503 

22156 

structural 

Myosin  light  chain  1,  atnal 

57513 

X 

8 

1 

496 

21282 

structural 

Myosin  regulatory  light  chain  2-A.  smooth  muscle  isoform 

57087 

X 

15 

15 

4  67 

19895 

structural 

Myosin  VI 

34865321 

X 

5 

- 

8  59 

116960 

structural 

Myosin,  unconventional  Myr2 1  heavy  chain 

12831209 

X 

15 

29 

9.41 

118090 

structural 

Myotonic  dystrophy  kinase-related  Cdc42-bmding  kinase  beta 

7446380 

054875 

11 

1 

638 

194020 

signaling 

N-WASP 

2274845 

008816 

4 

1 

8  33 

54325 

structural 

Periplakin 

34868748 

X 

7 

1 

5.35 

204147 

structural 

Plectin 

1709655 

P30427 

160 

24 

5.71 

533540 

structural 

Radixin 

34863387 

X 

60 

26 

569 

66125 

structural 

Spectnn.  beta 

34879632 

X 

137 

52 

548 

278423 

structural 

Spectnn-like  protein  GTRAP41 

11066461 

Q9QWN8 

11 

1 

5  59 

271064 

structural 

Tau  -big 

207158 

X 

5 

- 

554 

71774 

other 

Tensm 

27684945 

X 

16 

2 

629 

64171 

structural 

Tight  junction  protein  1  -ZOI 

34857171 

X 

26 

- 

596 

215532 

structural 

Tight  junction  protein  2  -  Z02 

1839162 

Q9UDY2 

15 

- 

696 

133972 

structural 

Transgelin  2 

27678408 

X 

3 

2 

841 

22393 

other 

Tnpartite  motrf  protein  TRIM2 

34857890 

X 

3 

- 

6  78 

90452 

other 

Tnpartite  motif  protein  TRIM3 

21362968 

070277 

5 

- 

8  17 

80796 

other 

Tropomodulin  1 

1628561 

P70567 

4 

- 

497 

40480 

structural 

Tropomodutin  2 

13928838 

P70566 

3 

- 

534 

39492 

structural 

Tropomodulin  3 

34864364 

X 

6 

4 

491 

46416 

structural 

Tropomyosin  2 

112442 

P04692 

11 

6 

4.71 

32695 

structural 

Tropomyosin  3 

17105362 

063600 

13 

4 

4  68 

32819 

structural 

Tropomyosin  4 

136081 

P09495 

7 

4 

466 

28510 

structural 

Tubulin,  alpha- 1 

55777 

X 

14 

11 

494 

50136 

structural 

Tubulin,  Beta  3 

21245098 

X 

6 

3 

4  88 

50303 

structural 

Tubulin,  beta-2 

37494 

P05217 

22 

7 

479 

49831 

structural 

Tubulin,  beta-5 

27465535 

P05218 

25 

11 

478 

49640 

structural 

Utrophm 

2960013 

X 

16 

2 

5.18 

391075 

structural 

Vimentin 

57480 

P31000 

26 

42 

506 

53602 

structural 

WASP  interacting  protein 

12275264 

X 

5 

11  42 

49751 

structural 

Peripherally  associated  on  inside 

Protein  name 

NCBI 

locus 

SwissProt  or 

Trembl  locus 

Number  of 

MS/MS  spectra 

Lung  P  RLMVEC 

PI 

MW 

Function 

Abl-interactor  1 

13242310 

X 

3 

1 

6.57 

51705 

signaling 

Adaptm,  alpha  A 

113337 

X 

5 

1 

651 

104045 

traffick 

Adaptin,  alpha-C 

90292 

P18484 

14 

1 

6  53 

103913 

traffick 

Adaptm.  beta 

18034787 

P21851 

11 

1 

5.19 

105691 

traffick 

Adaptin.  beta-3A 

34857415 

X 

3 

1 

547 

119142 

traffick 

Annexin  All 

1351943 

P48037 

6 

- 

539 

75623 

signaling 

Annexin  A2 

294518 

Q07936 

39 

33 

753 

38547 

signaling 

Annexin  A3 

113955 

P14669 

4 

- 

604 

36322 

signaling 

Annexin  A4 

1703320 

P55260 

14 

- 

5  32 

35744 

signaling 

Annexin  A6 

763181 

P48037 

24 

8 

5.39 

75623 

signaling 

Argmosucctnase 

31377525 

X 

3 

- 

588 

51391 

ee 

Arrestin  beta-1 

203102 

P29066 

9 

- 

584 

47066 

traffick 

Arrestm  C 

543855 

P36576 

3 

- 

548 

9878 

traffick 

Beta-adrenergic  receptor  kinase  1 

114153 

P26817 

7 

- 

654 

79785 

signaling 

Calgranulin  A 

13638435 

P50115 

4 

- 

5  69 

10107 

other 

Calgranulm  B 

478287 

P50116 

10 

- 

7.24 

13014 

other 

Calmodulin 

230824 

P02593 

4 

3 

409 

16706 

other 

Cdc42  guanine  nucleotide  exchange  factor  zizimin  1 

34876077 

X 

4 

- 

744 

260094 

signaling 

Centaunn-alpha2 

7636039 

Q9JK15 

4 

- 

935 

43524 

other 

Clathnn  coat  assembly  protein  AP50 

113332 

P20172 

7 

1 

9  57 

49655 

traffick 

Cytosolic  sorting  protein  PACS-la 

3347953 

088588 

6 

- 

7.61 

104700 

traffick 

Dynamin  II 

729380 

P39052 

6 

4 

630 

44036 

traffick 

E3KARP.  NHE3  kinase  A  regulatory  protein 

13925523 

X 

24 

536 

84531 

signaling 

EBP50-PDZ  interactor 

34879000 

X 

4 

1 

7.61 

56788 

other 

Exo70 

2827160 

054922 

3 

- 

627 

75046 

traffick 

Exocyst  complex  component  Sec3 

27694722 

X 

3 

1 

6  93 

64552 

traffick 

Exocyst  complex  component  Sec5 

19705549 

054921 

4 

3 

667 

104031 

traffick 

G-protem-coupled  receptor  kinase  5 

2499663 

Q62833 

4 

- 

848 

67783 

signaling 

predicted 

TMs 
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Granzyme  K 

1708036 

P49864 

4 

- 

9  59 

28465 

signaling 

Growth-arrest-speciftc  protein  7 

25742641 

X 

4 

2 

724 

50394 

other 

GTPase  activating  protein  1 

34857388 

X 

26 

3 

6  13 

193859 

signaling 

Heat  shock  cognate  71  kDa  protein  (HSC73) 

56379 

P08109 

13 

2 

537 

70871 

other 

Hic-5/ARA55  protein 

12659068 

Q99PD6 

3 

- 

8  36 

36181 

other 

Integnn-linked  kinase 

13111653 

055222 

24 

6 

830 

51373 

signaling 

Janus  protein  tyrosine  kinase  1 

2288925 

035803 

3 

- 

826 

99649 

signaling 

Latent  TGF-beta  binding  protein-2  like  protein 

2463409 

035806 

3 

12 

520 

189867 

structural 

Lim  protein  1  (FHL-1) 

4894849 

Q9WUH4 

3 

1 

8  76 

31904 

other 

Methionine  aminopeptidase  2 

204004 

P38062 

7 

- 

572 

53052 

ee 

Mu  nc  18-3 

12583689 

Q99PV2 

3 

828 

68021 

transport 

Neurofibromm  2 

32363191 

06 3648 

3 

1 

620 

68712 

structural 

Numb  protein  isoform 

34867377 

X 

7 

- 

9  32 

69246 

other 

Parvm-alpha 

13526899 

Q9HB97 

5 

2 

569 

42292 

structural 

Parvm-beta 

34867494 

X 

3 

- 

5.90 

41701 

structural 

Peptide  N-mynstoyttransferase  1 

22507314 

X 

7 

1 

9  14 

56937 

other 

Pen  ax  in 

18642399 

063425 

28 

1 

860 

146400 

other 

Peroxiredoxm  4 

4336879 

Q920V5 

3 

1 

586 

30540 

other 

Phosphatidyimositol  4-kmase,  230kDa 

1944499 

008662 

3 

- 

6.57 

231320 

signaling 

Phosphatidylinositoi-4-phosphate  5-kmase  type  II  alpha 

16758974 

Q9R0I8 

5 

- 

650 

46225 

signaling 

Phosphatidyhnositol-4-phosphate  5-kinase  type  II  beta 

16758316 

088377 

5 

- 

7.18 

47264 

signaling 

Phosphatidylinositol-4-phosphate  5-kinase  type  II  gamma 

17978457 

088370 

4 

1 

6  39 

47336 

signaling 

Phosphatidyhnositol-binding  clathnn  assembly  protein 

16758324 

055012 

4 

2 

858 

69286 

traffick 

Phospholipase  C-beta-1 

91896 

P10687 

10 

1 

586 

138344 

signaling 

Phospholipase  C-beta-3 

13177635 

P51432 

21 

1 

5.77 

139492 

signaling 

Phospholipase  C-beta-4 

435757 

Q9QW07 

13 

1 

641 

134497 

signaling 

Phospholipase  C-beta-4B 

4063009 

X 

10 

1 

6.11 

116040 

signaling 

Phospholipase  C-gamma-2 

130230 

P24135 

4 

- 

6.40 

147735 

signaling 

Pincher 

20135683 

Q8R3Z7 

19 

2 

633 

61468 

traffick 

Prohibitin 

13937353 

X 

3 

11 

5  57 

29820 

other 

Prohne  nch  synapse  associated  protein  2 

5262748 

Q9JLU4 

13 

2 

898 

193258 

traffick 

Protein  kinase  A  anchoring  protein  2 

34868424 

X 

9 

- 

488 

90962 

signaling 

Protein  kinase  C  delta  bindig  protein 

4092842 

X 

8 

5 

579 

27910 

signaling 

Protein  tyrosine  phosphatase,  non-receptor  type  6 

16758788 

P81718 

8 

- 

763 

69578 

signaling 

Protein-tyrosine  phosphatase,  non-receptor  type  6 

16758788 

X 

9 

- 

763 

69578 

signaling 

Ral  simian  leukemia  viral  oncogene  homolog  A 

29741807 

X 

3 

- 

934 

24325 

signaling 

Ras  suppressor  protein  1 

34877253 

X 

18 

5 

5.34 

347557 

signaling 

Ras-bindmg  protein  SUR-8 

27685625 

X 

4 

- 

919 

32823 

signaling 

Rho-  and  Arf -GTPase  activating  protein  1 

27682975 

X 

6 

- 

568 

235892 

signaling 

Rho  GTPase  activating  protein  5 

34865207 

X 

4 

1 

6.15 

172287 

signaling 

RhoGEF  glutamate  transport  modulator  GTRAP48 

13027442 

X 

3 

- 

550 

168534 

signaling 

Rho-mteracting  protein  3 

10803059 

Q9ERE6 

7 

13 

5.95 

117113 

signaling 

Scaffolding  protein  SLIPR 

7650497 

X 

5 

- 

580 

129405 

other 

Septin  2 

13928415 

X 

14 

16 

6  15 

41593 

structural 

Septm  4 

34872968 

X 

7 

- 

581 

53083 

structural 

Septin  7  (CDC10) 

9789715 

Q9WVC0 

20 

15 

8  82 

50508 

structural 

Septm-hke  protein 

28849875 

Q9QZR6 

3 

2 

865 

63792 

structural 

Senne  proteinase  inhibitor  3 

111807 

P09006 

3 

- 

532 

46652 

other 

SerineAhreomne  kinase  10 

9507151 

X 

5 

- 

9  44 

57390 

signaling 

SerineAhreonine  kinase  MARK2 

2052191 

008679 

3 

- 

963 

80872 

signaling 

SenneAhreomne  protein  kinase  TAOI 

7514084 

088664 

3 

- 

730 

115952 

signaling 

SenneAhreomne  protein  phosphatase  PP1  Ml  10  subunit 

802105 

X 

4 

- 

5.30 

109720 

signaling 

SenneAhreomne  protein  phosphatase  PP1B 

585712 

P37140 

4 

4 

584 

37187 

signaling 

Serum  depnvation  response 

34875838 

X 

12 

2 

5.21 

46386 

signaling 

Smadl 

3192871 

P97588 

3 

- 

690 

52713 

signaling 

SNAP-23 

2970673 

070377 

5 

3 

482 

23235 

traffick 

Sodium-hydrogen  exchanger  3  regulatory  factor  1 

11024674 

X 

7 

- 

5.70 

38830 

signaling 

Sodium-hydrogen  exchanger  3  regulatory  factor  2 

15419607 

X 

23 

- 

7.25 

37369 

signaling 

Syndapin  2 

6651167 

Q9QY17 

12 

- 

5.04 

55978 

traffick 

Thioredoxm-dependent  peroxide  reductase  2 

2499470 

Q63716 

7 

4 

8.27 

22109 

other 

Thrombospondin  1 

33340123 

AAQ 14549 

4 

9 

4  74 

129671 

adhesion 

TOAD-64 

1351260 

P47942 

3 

9 

595 

62278 

other 

TPR-contaming  Rab8b-mteractmg  protein 

27465633 

Q925N3 

3 

- 

5.12 

64885 

other 

Ubiquitin 

20302085 

X 

4 

5 

694 

34361 

other 
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Outside  bound  -  secreted  /  blood 


Protein  name 

NCBI 

locus 

SwissProt  or 

Trembl  locus 

MS/MS  spectra 

Lung  P  RLMVEC 

PI 

MW 

Function 

predicted 

TMs 

Apohpoprotem  A-l 

91984 

P04639 

3 

5  52 

30088 

other 

Beta-2-microglobulin 

999862 

P07151 

4 

- 

780 

13720 

other 

Fetuin 

112459 

P24090 

5 

- 

6  05 

37982 

other 

• 

Fibrinogen  B.  beta  chain 

455105 

PI 4480 

6 

- 

789 

54303 

other 

Fibronectin 

120178 

P04937 

3 

117 

550 

272511 

other 

Glutathione  peroxidase 

121668 

P04041 

7 

2 

765 

22258 

other 

Hemoglobin,  alpha  chain 

1304381 

P01946 

12 

- 

793 

15197 

other 

Hemoglobin,  beta  chain 

204570 

P02091 

5 

- 

799 

15848 

other 

Hepatoma-denved  growth  factor  related  protein  3 

21955178 

Q9D2M7 

6 

- 

84 

22447 

other 

Histidine-rich  glycoprotein  1 

13358876 

X 

13 

- 

7  76 

59049 

other 

Histidine-nch  glycoprotein  2 

13358878 

X 

14 

- 

7  55 

58056 

other 

Inter-alpha-inhibitor  H4P  heavy  chain 

7441758 

035802 

4 

608 

103607 

other 

1 

Laminin  alpha-5 

34860912 

X 

3 

6 

6.21 

415468 

other 

Laminin  beta-1 

27717157 

X 

3 

2 

4  82 

21738 

other 

Laminin  gamma- 1 

27712576 

PI 1047 

3 

3 

520 

167423 

other 

Mast  cell  protease  8 

8928167 

P97594 

6 

- 

902 

27486 

other 

Mast  cell  protease  9  (Granzyme  J) 

1763232 

P97595 

6 

- 

933 

26314 

other 

Prolargin 

10834558 

Q9EQP5 

15 

1 

9.51 

43179 

other 

Serum  albumin 

55628 

P02770 

16 

1 

609 

68719 

other 

Spondm 

544353 

P35446 

7 

- 

585 

90773 

other 

Transferrin 

1854476 

P12346 

7 

1 

694 

76364 

other 

Vitronectin 

1255044 

Q62905 

6 

- 

568 

54679 

other 

Von  Willebrand  factor 

4467984 

P04275 

12 

- 

532 

309299 

other 

Poorly  characterized  proteins  (unknown 

localization, 

structure,  and  function) 

Number  of 

Protein  name 

NCBI 

locus 

SwissProt  or 

Trembl  locus 

MS/MS  spectra 

Lung  P  RLMVEC 

PI 

MW 

Function 

predicted 

TMs 

AC2-233 

32527749 

Q7TPJ1 

3 

- 

6.16 

161633 

Bram-ennched  WD-repeat  protein 

13660779 

X 

6 

- 

6  74 

57551 

cDNA  Sequence  BC019977 

34871032 

X 

3 

- 

999 

90565 

DAMP-1  protein 

28070980 

Q811A2 

4 

- 

8  38 

19674 

EH-domam  containing  protein  2 

34452241 

Q8R491 

5 

2 

595 

61106 

EH-domain  containing  protein  3 

34536836 

X 

6 

2 

604 

60807 

Granzyme-like  protein  II 

296176 

Q06606 

7 

- 

8  74 

27463 

Hypothetical  protein  FLJ 10849 

34876531 

X 

18 

20 

647 

58774 

Hypothetical  protein  FU13187 

34866158 

X 

3 

- 

855 

27741 

Hypothebcal  protein  FLJ20401 

34856089 

X 

6 

- 

762 

98851 

Hypothetical  protein  FLJ30973 

34864323 

X 

4 

- 

5  55 

61017 

Hypothetical  protein  KIAA0332 

27721099 

X 

3 

- 

7.32 

110303 

Hypothetical  protein  KIAA0635 

27694697 

X 

3 

2 

5  66 

55620 

Hypothetical  protein  KIAA0763 

27712638 

X 

3 

1 

886 

139934 

Hypothetical  protein  KIAA0915 

27672832 

X 

8 

- 

866 

91406 

Hypothetical  protein  KIAA0982 

27688091 

X 

4 

- 

5.69 

57127 

Hypothetical  protein  KIAA1902 

27699143 

X 

3 

6  16 

89929 

Hypothetical  protein  XP_238419 

27716235 

X 

4 

1 

10.00 

9379 

Hypothetical  RNA  binding  protein  RDA288 

18041977 

X 

6 

16 

842 

43012 

Similar  to  KIAA 1 185  protein 

34872773 

X 

6 

1 

852 

63531 

Lethal  giant  larvae-like  protein  1  homolog 

22779849 

X 

3 

- 

6  14 

112495 

Neuronal  tissue-ennched  acidic  protein 

11560135 

X 

3 

5 

4.50 

21790 

Protein  kinase  SK2 

7514055 

X 

12 

1 

496 

137888 

RIKEN  cDNA  2010012F05 

34859075 

X 

3 

2 

480 

23760 

Similar  to  a  C  elegans  protein  RW1 

34875601 

X 

3 

1 

7  24 

190311 

Similar  to  actin  binding  protein-278 

34869526 

X 

23 

30 

541 

284704 

Similar  to  Bram-Speafic  Protein  p25  Alpha 

27729783 

X 

3 

- 

9.71 

54772 

Similar  to  BUB3 

34859446 

X 

12 

- 

6  32 

30535 

Similar  to  cask-interacting  Protein  2 

34875360 

X 

4 

- 

6  58 

127726 

Similar  to  CGI-74  protein 

34855164 

X 

3 

- 

11  56 

26564 

Similar  to  chromodomam-helicase-DNA-binding  protein  5 

34860336 

X 

3 

5 

5.74 

301239 
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Similar  to  cleavage  and  polyadenylation  specific  factor  6 

34864893 

X 

4 

1 

914 

69006 

Similar  to  cobl-related  protein  1 

34854739 

X 

3 

6  39 

135106 

Similar  to  cytochrome  b5  Reductase  1 

27712062 

X 

3 

1 

895 

34233 

Similar  to  DEAD-box  protein  05 

27690065 

X 

14 

6 

9  06 

69239 

Similar  to  DEAD-box  protein  1 7 

34866944 

X 

8 

5 

8  75 

73945 

Similar  to  DEAH  box  protein  15 

34878065 

X 

4 

1 

7.12 

90977 

Similar  to  dJ1033H22  1  (KIAA0554) 

34860217 

X 

9 

- 

6  40 

68740 

Similar  to  DNA-PKcs 

34870011 

X 

3 

1 

6  90 

470420 

Similar  to  DOCK  1 80  protein 

34874363 

X 

4 

- 

7  15 

218571 

Similar  to  DRIM  (downregulated  in  metastasis)  protein 

27717947 

X 

3 

1 

8  59 

289353 

Similar  to  ELM01 

34876187 

X 

6 

- 

5  86 

44456 

Similar  to  ELM02  (KIAA 1834) 

34860711 

X 

3 

- 

5.91 

86108 

Similar  to  eukaryotic  translation  initiation  factor  5A 

27672956 

Q8VHU8 

5 

6 

5.07 

16832 

Similar  to  fibrillann 

34855407 

X 

3 

10 

10.25 

34222 

Similar  to  FK506  binding  protein  3 

27664664 

X 

5 

2 

9  29 

25148 

Similar  to  FSHD  region  gene  1  protein 

34872381 

X 

5 

- 

8  95 

29031 

Similar  to  Hbsll  protein 

34852821 

X 

3 

- 

9  17 

47772 

Similar  to  heat  shock  70  kDa  protein  12A  (HSPA12A) 

34864793 

X 

3 

1 

8  87 

129368 

Similar  to  heat  shock  70  kDa  protein  12B  (HSPA 12B) 

34858716 

X 

25 

- 

858 

81822 

Similar  to  HSPC31 5 

34871854 

X 

3 

- 

6  37 

28146 

Similar  to  hypothetical  protein 

27710240 

X 

6 

- 

8  47 

77072 

Similar  to  hypothetical  protein  MGC29390 

34869210 

X 

3 

1 

6  53 

202867 

Similar  to  Integhn  alpha-L 

34859294 

X 

5 

- 

6  15 

151135 

Similar  to  KIAA  1083  protein 

34862486 

X 

3 

- 

961 

69002 

Similar  to  KIAA  1230  protein 

27713641 

X 

3 

- 

5  43 

111708 

Similar  to  KIAA  1849  protein 

27731437 

X 

6 

- 

11.27 

18939 

Similar  to  LD39815p 

27716313 

X 

3 

1 

6  23 

29077 

Similar  to  leiomodin  1 

34880269 

X 

3 

- 

9.27 

62197 

Similar  to  Map4k6-pendmg  protein 

34871056 

X 

4 

- 

8  00 

56056 

Similar  to  mBLVR 

34862288 

X 

4 

- 

8.11 

81993 

Similar  to  mitochondhal  import  receptor  subunit  TOM 34 

34860666 

X 

3 

- 

923 

34461 

Similar  to  mKIAA0219 

34872627 

Q92616 

4 

7 

7  36 

303089 

Similar  to  myeloperoxidase 

34873270 

X 

3 

- 

961 

80883 

Similar  to  Na+  and  Hi- -coupled  glutamine  transporter 

16923996 

Q8VEK3 

9 

16 

5  92 

87748 

Similar  to  nebulin 

34854661 

X 

3 

1 

9  08 

909311 

Similar  to  nicotinamide  nucleotide  transhydrogenase 

34854180 

Q 13423 

3 

9 

9  17 

34922 

Similar  to  nuclear  receptor  coactivator  NCoA-62 

34867492 

X 

3 

2 

9  00 

145690 

Similar  to  OSBP-related  protein  6 

3485644 8 

X 

3 

- 

6  63 

105187 

Similar  to  osmotic  stress  protein 

34856875 

X 

3 

3 

9  91 

35022 

Similar  to  p53  inducible  protein 

34870680 

X 

7 

- 

6  10 

127654 

Similar  to  polymerase  l-transcnpt  release  factor 

27689507 

X 

25 

5 

5  43 

43909 

Similar  to  pre-mRNA  cleavage  factor  Im  (25kD) 

12655103 

X 

13 

3 

8.85 

26227 

Similar  to  protein  disulfide  isomerase  A5 

34869214 

X 

3 

- 

6  35 

89317 

Similar  to  protein  disulfide  isomerase-related  protein 

34869214 

X 

4 

2 

6  35 

89317 

Similar  to  PTB-associated  splicing  factor 

34871066 

X 

6 

10 

8  94 

65137 

Similar  to  PTPL1 -associated  RhoGAP  1 

34862742 

X 

6 

- 

5  83 

128868 

Similar  to  Rab8-lnteracting  Protein 

34861505 

X 

3 

- 

6  06 

91387 

Similar  to  rho/rac  guanine  nucleotide  exchange  factor  ,GEF 

27693092 

X 

4 

- 

5.39 

72437 

Similar  to  ribosome  binding  protein  1 

34858829 

X 

8 

9 

9  00 

143328 

Similar  to  RIKEN  cDNA  1190003A07 

34861128 

X 

3 

1 

5.46 

156956 

Similar  to  RIKEN  cDNA  1 50001 9M23 

34932774 

X 

3 

2 

899 

20550 

Similar  to  RIKEN  cDNA  181001 3P09 

34855823 

X 

3 

- 

8  66 

38164 

Similar  to  RIKEN  cDNA  3010001 A07 

34868074 

X 

4 

1 

6  92 

82148 

Similar  to  RNA  helicase  (DEAD-box  protein  03) 

34933204 

X 

7 

13 

618 

69437 

Similar  to  Schwachman-Bodian-Diamond  syndrome  protein 

27664348 

X 

4 

1 

891 

28753 

Similar  to  selective  hybridizing  clone  (SHYC) 

34856144 

X 

9 

- 

6  89 

164053 

Similar  to  SET  binding  factor  1 

34867669 

X 

3 

1 

8.41 

196456 

Similar  to  slingshot  1 

34872826 

X 

4 

1 

6  36 

178058 

Similar  to  small  nuclear  nbonudeoprotem  associated  protein  N 

27687269 

Q63747 

6 

3 

11.31 

124746 

Similar  to  small  nuclear  ribonudeoprotem  Sm  D2 

27676406 

X 

3 

3 

992 

13527 

Similar  to  small  nuclear  ribonudeoprotem  Sm  D2 

27676406 

X 

3 

3 

9  92 

13527 

Similar  to  small  nuclear  ribonudeoprotem  U1A 

34855379 

X 

4 

- 

6  40 

15295 

Similar  to  splicing  fador  (CC1  3) 

34860256 

X 

4 

- 

10.23 

68157 

Similar  to  splidng  fador  PRP8 

34872765 

X 

3 

- 

8  86 

304438 

Similar  to  splicing  factor  SC35 

27691222 

X 

3 

- 

11  86 

25476 

Similar  to  splicing  fador  U2AF  65  kDa  subunit 

34854490 

X 

9 

- 

498 

28114 

Similar  to  sulfide  qumone  reductase 

34856819 

X 

5 

5 

9.02 

69000 

Similar  to  SWI/SNF  complex  170  kDa  subunit 

34862210 

X 

5 

2 

5  28 

126799 

Similar  to  talin  2 

34864297 

X 

77 

37 

547 

284529 
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Similar  to  Tend  protein 

34868665 

X 

3 

- 

863 

164794 

Similar  to  TMP2 

34862084 

X 

3 

- 

7  73 

163993 

1 

Similar  to  transcription  factor  BTF3 

27686915 

X 

8 

3 

685 

17699 

Similar  to  transcription  factor  CA 150 

34879270 

X 

10 

- 

880 

123020 

2 

Similar  to  Transcnption  Initiation  Factor  HE.  Alpha  Subunit 

27666810 

X 

3 

1 

472 

49229 

Similar  to  transcnption  termination  factor  1  interacting  peptide  20 

34854655 

X 

3 

869 

42723 

Similar  to  transcnptional  activator  Pur-alpha 

34878862 

X 

4 

2 

6  07 

34884 

Similar  to  tyrosyl-tRNA  hgase 

34871588 

X 

7 

1 

861 

63026 

Similar  to  U2  small  nuclear  nbonucleoprotein  B 

34858820 

X 

3 

9  72 

25352 

Similar  to  vinculin 

34868946 

X 

12 

7 

5  52 

135513 

Similar  to  WD  repeat  endosomal  protein 

34866638 

X 

3 

- 

7  05 

84698 

Similar  to  Werner  syndrome  helicase  homolog 

34878743 

X 

3 

- 

531 

99948 

Similar  to  zinc  finger  protein  Cezanne 

34858073 

X 

3 

2 

685 

150318 

Similar  to  zinc  finger  transcnption  factor  ZNF207 

34872895 

X 

4 

- 

920 

52779 

TEMO  (novel  testicular  molecule) 

9954445 

Q9ESY1 

3 

- 

671 

206589 

Thyroid  hormone  receptor  interactor  10  (salt  tolerant  protein) 

16758800 

P97531 

4 

3 

523 

62788 

Ribosomal  proteins 

Protein  name 

NCBI 

locus 

SwissProt  or 

Trembl  locus 

Number  of 

MS/MS  spectra 

Lung  P  RLMVEC 

PI 

predicted 

MW  Function  TMs 

ABC50 

10863747 

Q9ERQ2 

4 

6  79 

92749 

Acidic  ribosomal  phosphoprotein  PO 

11693176 

PI 9945 

4 

- 

591 

34215 

Elongation  factor  1-alpha  1 

1220484 

Q64718 

31 

20 

9.10 

50150 

Elongation  factor  1 -alpha  2 

206440 

P27706 

5 

1 

9.11 

50454 

Eukaryotic  translation  initiation  factor  1.  alpha  subunit 

1352425 

P47813 

3 

491 

46416 

Eukaryotic  translation  initiation  factor  2.  alpha  subunit 

325 

P05199 

9 

2 

5.02 

35977 

Eukaryotic  translation  initiation  factor  2.  beta  subunit 

27703942 

X 

7 

1 

5.61 

38243 

Eukaryotic  translation  initiation  factor  2.  gamma  subunit 

27663888 

X 

10 

3 

9.51 

17740 

Eukaryotic  translation  initiation  factor  5 

585303 

Q07205 

5 

1 

5  36 

48954 

Ribosomal  proteins  large  subunit  family 

X 

X 

X 

X 

basic 

X 

Ribosomal  proteins  small  subunit  family 

X 

X 

X 

X 

basic 

X 

Ribosomal  S6  kinase  alpha  1 

13592065 

063531 

3 

- 

797 

82883 

Endoplasmic  reticulum  proteins 


Number  of 


NCBI 


SwissProt  or 


MS/MS  spectra 


predicted 


Protein  name 

locus 

Trembl  locus 

Lung  P 

RLMVEC 

PI 

MW  Function 

TMs 

78  KD  Glucose-regulated  protein 

121574 

P06761 

10 

50 

5.07 

72347 

Calnexin 

543922 

P35565 

7 

23 

4  49 

67255 

1 

Calrebculm 

55855 

P18418 

10 

15 

433 

47995 

Cytochrome  b5.  ER 

231928 

P00173 

3 

1 

4  90 

15224 

Cytochrome  P450  2B1 

223633 

P001 76 

3 

- 

700 

55934 

1 

Flavin-containing  monooxygenase  1 

204152 

P36365 

5 

- 

866 

59781 

1 

Heat  shock  protein  47  kDa 

8393057 

P29457 

6 

15 

8  88 

46518 

NADH-cytochrome  b5  reductase 

127847 

P20070 

7 

3 

8  57 

34043 

NADPH-cytochrome  P450  reductase 

3318958 

POO 388 

5 

5 

5.30 

76832 

1 

Peptidyl-prolyl  cis-trans  isomerase  B 

2143900 

P24368 

19 

15 

930 

23025 

Prostaglandin  F2  receptor  negative  regulator 

2497303 

Q62786 

3 

1 

6  16 

98731 

1 

Protein  disulfide  isomerase  A3 

56905 

PI 1598 

21 

36 

588 

56623 

Protein  disulfide  isomerase  A6 

2501206 

Q63081 

3 

15 

495 

47220 

Ribophonn  1 

132560 

P07153 

4 

16 

605 

68304 

1 

Ribophonn  II 

132562 

P25235 

4 

5 

5.72 

69061 

3 
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Durr  et  al 

Mitochondrial  proteins 


Protein  name 


Acetoacetyl-CoA  thiolase 
Acetyl-CoA  acyltransferase 
Aldehyde  dehydrogenase,  mitochondhal 
ATP  synthase  CF(O)  -  B  chain,  mitochondhal 
A  TP  synthase  CF(O)  -  D  chain,  mitochondnal 
ATP  synthase  CF(1)  -  alpha  chain,  mitochondnal 
ATP  synthase  CF(1)  -  beta  chain,  mitochondhal 
Cytochrome  65.  mitochondnal 
Cytochrome  c.  somatic 
Glutamate  dehydrogenase 
Heat  shock  40  kDa  protein  4 
NADP+-spectfic  isocitrate  dehydrogenase 
PDC-E2 

Succmyi-CoA  synthetase,  alpha  chain 
Thfunctional  enzyme,  alpha  subunit 
Tnfunctional  enzyme,  beta  subunit 
Ubiquinol-cytochrome  C  reductase  complex  core  protein  2 


Nuclear  proteins 


Protein  name 


2‘, 3' -cyclic  nucleotide  3'-phosphodiesterase 

Centrosomal  protein  2 

DNA  mismatch  repair  protein  Msh2 

DNA  topoisomerase  I 

F BP-mteractmg  repressor 

G10  protein  homolog  (edg-2) 

High  mobility  group  protein  1 
High  mobility  group  protein  2 
Histones 

La  Ribonudeoprotem 

Lamm  A 

Lamm  B1 

Lamm  C2 

Nudeotin 

Nudeophosmm 

Poly  [ADP-hbose]  polymerase- 1  (PARP- 1) 

Pynmidine  binding  protein  1 

Sphceosomal  protein  SAP155 

Strudural  maintenance  of  chromosome  3 

Thymopoietin  beta  (LAMINA-associated  polypeptide  2) 

Transchption  fador  MTSG1 

Transcnption  initiation  factor  RAP30 


Number  of 


NCBI 

locus 

SwissProt  or 

Trembl  locus 

MS/MS  spectra 

Lung  P  RLMVEC 

PI 

MW  Function 

predicted 

TMs 

135757 

PI 7764 

4 

3 

8  92 

44695 

135762 

PI 3437 

3 

3 

809 

41871 

118505 

P1 1884 

3 

4 

663 

56488 

114625 

P19511 

3 

6 

9  39 

28869 

220904 

P31399 

4 

8 

621 

18632 

6729934 

PI 5999 

14 

22 

922 

58826 

1374715 

P10719 

17 

23 

518 

56354 

2253161 

P04166 

3 

1 

4  92 

16265 

118008 

P00009 

4 

6 

9.61 

11474 

6980956 

PI 0860 

12 

13 

8  05 

61428 

1706475 

P54102 

4 

3 

665 

44868 

13928690 

X 

4 

7 

6  53 

46734 

266685 

P08461 

3 

3 

5  70 

58764 

135025 

PI  3086 

4 

9 

954 

35032 

510108 

Q64428 

8 

14 

9.11 

82513 

543387 

Q60587 

5 

3 

950 

51414 

418146 

P32551 

3 

6 

9  16 

48373 

NCBI 

locus 

SwissProt  or 

Trembl  locus 

Number  of 

MS/MS  spectra 

Lung  P  RLMVEC 

PI 

MW  Function 

predicted 

TMs 

294527 

Q64575 

4 

903 

47268 

27704348 

Q9BV73 

3 

2 

498 

208800 

1709122 

P54275 

3 

1 

5.77 

104028 

4809197 

X 

5 

1 

935 

90760 

5524727 

Q9NZA0 

6 

1 

525 

58171 

3064070 

070454 

6 

- 

9.15 

17069 

1708258 

P07155 

24 

8 

5  70 

24894 

1708260 

P52925 

20 

1 

7  10 

24028 

X 

X 

X 

X 

X 

X 

729919 

P38656 

4 

4 

9.47 

47777 

1346413 

P48679 

25 

53 

654 

74324 

1575794 

P70615 

9 

16 

5  16 

66475 

1556433 

P48679 

21 

42 

654 

74324 

92559 

PI 3383 

3 

- 

467 

77016 

114763 

PI 3084 

3 

7 

462 

32560 

3123251 

P27008 

10 

1 

9  07 

112529 

13487910 

Q00438 

3 

6 

9  14 

56937 

9885342 

Q9ET34 

3 

- 

5.67 

54496 

29336525 

P97690 

3 

1 

782 

138448 

6981660 

062733 

4 

4 

940 

50278 

1 

30017413 

Q80299 

3 

1 

7.20 

50728 

220891 

Q63489 

5 

1 

924 

28380 
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Endothelial  cell  associated  marker  proteins 


SwissProt  or 


Number  of 
MS/MS  spectra 


predicted 


Protein  name 

NCBI  locus 

Trembl  locus 

Species 

Rat  Lung 

RLMVEC 

TMs 

5'-nucleotidase  (CD73) 

112826 

P21588 

R 

10 

1 

Adrenergic  receptor  alpha  2B 

2245628 

X 

H 

1 

- 

7 

Alpha-2  macroglobulm 

112914 

P28666 

M 

1 

- 

0 

Ammopeptidase  N  (CD13) 

2499898 

P97449 

R 

3 

4 

1 

Angrotensin  II  receptor  (1  or  2) 

13591910 

R 

1 

1 

7 

Angiotensin-converting  enzyme  (CD134) 

11493661 

P12821 

R 

44 

- 

1 

Annexin  IV 

1703320 

P55260 

R 

14 

. 

0 

Annexm  V 

4033508 

PI 4668 

R 

2 

2 

0 

APC  protein 

114033 

P25054 

H 

2 

- 

0 

Aquaporin-CHIP 

267413 

P29975 

R 

3 

- 

6 

Bradykinin  receptor  B2 

456684 

R 

- 

1 

7 

Carbonic  anhydrase  IV 

544726 

P48284 

R 

11 

- 

0 

Caveotin-1 

17017233 

Q8V1K9 

R 

13 

5 

1 

Chemokine  receptor  CCX  CKR 

19879587 

M 

1 

. 

7 

Opeptidyl  peptidase  IV  (CD26) 

118905 

PI 4740 

R 

31 

- 

1 

Ecto-apyrase  (CD39) 

2499219 

P55772 

M 

1 

- 

2 

EDG-1 

1352343 

P48303 

R 

1 

- 

7 

EDG-2 

729546 

P41223 

H 

3 

- 

0 

EGF 

181980 

X 

H 

1 

- 

1 

EGF 

1071851 

X 

M 

- 

2 

2 

Endomudn 

34860660 

X 

R 

1 

- 

1 

Endothelial  actin-bmding  protein 

17486458 

P21333 

H 

61 

81 

0 

Endothelial  cell-selective  adhesion  molecule 

13991773 

Q925F2 

M 

5 

- 

1 

Endothelial  collagen 

17738302 

P27658 

H 

3 

0 

Endothelial  differentiation-related  factor  1 

4503453 

060869 

H 

2 

- 

0 

Endothelial  plasminogen  activator  inhibitor 

129578 

P20961 

R 

0 

4 

0 

Endothelm  converting  enzyme 

529085 

P42893 

R 

7 

1 

H-CAM  (CD44) 

19923703 

008779 

R 

2 

4 

1 

Integnn  alpha  V 

1170592 

P43406 

M 

2 

5 

1 

Integhn  alpha- 1  (CD49a) 

124941 

P18614 

R 

8 

18 

2 

Integnn  alpha-3  (CD49c) 

3183040 

Q62470 

M 

7 

- 

2 

Integnn  alpha-5  (CD49e) 

1708569 

P1 1688 

M 

4 

6 

1 

Integnn  beta-1  (CD29) 

.  520566 

P49134 

R 

11 

10 

1 

Intercellular  adhesion  molecule-1  (I-CAM1;  CD54) 

124100 

Q 00238 

R 

6 

10 

2 

Intercellular  adhesion  molecule-2  ( I-CAM2 ;  CD102) 

462381 

P35330 

M 

3 

- 

1 

MAC-mhibitor  (CD59) 

2507508 

P27274 

R 

1 

- 

2 

MDR  1A 

266517 

P21447 

R 

1 

9 

11 

MECA32 

14161394 

X 

M 

2 

1 

1 

Microvascular  endothelial  differentiation  gene  1 

10732861 

P97554 

R 

1 

1 

MRP-1  (CD9) 

729088 

P40241 

R 

3 

2 

4 

MUC18 

13095926 

X 

R 

5 

- 

1 

Muscahntc  acetylcholine  receptor  M3 

92492 

X 

R 

1 

1 

7 

Na+K+  transporting  ATPase  alpha  1 

18204493 

P06685 

M 

20 

18 

10 

Nicotinic  acetylcholine  receptor  alpha  3 

19073315 

M 

1 

- 

4 

Nicotinic  acetylcholine  receptor  beta  3 

113108 

PI 2391 

R 

1 

- 

4 

Nitnc-oxide  synthase 

266646 

P29476 

R 

1 

1 

0 

P2Y  punnoceptor  6 

2495019 

R 

- 

1 

6 

PAR-1  B  alpha 

15042611 

X 

H 

1 

- 

0 

Platelet  endothelial  tetraspan  antigen-3 

11968106 

Q9QZA6 

R 

1 

3 

4 

Plateiet-dehved  growth  factor  receptor 

129891 

P05622 

M 

1 

1 

1 

Platelet-endothelial  cell  adhesion  molecule-1  (CD31) 

1684839 

X 

R 

4 

- 

1 

Podocalyxin 

4996222 

X 

R 

5 

- 

1 

Puhnergic  receptor  5 

34223726 

P43657 

H 

1 

- 

7 

PV-1 

5281517 

Q9WV78 

R 

21 

- 

1 

RAGE 

2497319 

Q63495 

R 

11 

- 

1 

Scavenger  receptor  (CD36) 

3243055 

X 

R 

1 

- 

2 

Scavenger  receptor  class  B  type  1 

4210542 

X 

R 

1 

1 

2 

Scavenger  receptor  class  F 

4507203 

X 

H 

1 

- 

0 

Sialomuan  (CD34) 

495716 

064314 

M 

1 

- 

1 

Thrombomodulin 

13929084 

X 

R 

6 

- 

1 

Tight  junction  protein  1 -Z01 

303710 

P39447 

M 

26 

- 

0 

Tight  junction  protein  2  -  Z02 

1839162 

Q9UDY2 

R 

15 

- 

0 

Transfemn  receptor  (CD71) 

136378 

P02786 

H 

2 

1 

1 

Tumor  endothelial  marker  4  (TEM4) 

15987489 

Q96PFE 

H 

2 

- 

0 

Tumor  endothelial  marker  6  (TEM6,  Tensin  3) 

17511209 

Q96PE0 

H 

2 

1 

0 

Tyrosine-protein  kinase  receptor  TIE-2 

34869831 

Q9QW24 

R 

5 

- 

1 

Vascular  adhesion  protein- 1 

5902787 

070423 

M 

2 

- 

1 

Vascular  cell  adhesion  protein  1 

267284 

P29534 

R 

- 

8 

1 

Vascular  endothelial  cell  specific  protein  1 1 

13928415 

X 

R 

14 

16 

0 

Vascular  endothelial  junction-associated  molecule 

10720348 

P57087 

H 

2 

- 

1 

Vascular  endothelial-cadhenn  1 

1150514 

P55284 

M 

7 

- 

2 

Vascular  endothelial-cadhenn  2 

8164037 

Q9NPG4 

H 

2 

1 

1 

Von  Willebrand  factor 

4467984 

P04275 

R 

12 

- 

0 
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Subtractive  proteomic  mapping  of  the 
endothelial  surface  in  lung  and  solid 
tumours  for  tissue-specific  therapy 

Phil  Oh,  Yan  U,  Jingyl  Yu,  Eberhard  Durr,  Karolina  M.  Krasinska,  Lucy  A.  Carver,  Jacqueline  E.  Testa  &  Jan  E.  Schnltzer 

Sidney  Kimmel  Cancer  Center,  10835  Altman  Row,  San  Diego,  California  92121,  USA 


The  molecular  complexity  of  tissues  and  the  inaccessibility  of  most  cells  within  a  tissue  limit  the  discovery  of  key  targets  for 
tissue-specific  delivery  of  therapeutic  and  imaging  agents  in  vivo.  Here,  we  describe  a  hypothesis-driven,  systems  biology 
approach  to  identifying  a  small  subset  of  proteins  induced  at  the  tissue-blood  interface  that  are  inherently  accessible  to  antibodies 
injected  intravenously.  We  use  subcellular  fractionation,  subtractive  proteomics  and  bioinformatics  to  identify  endothelial  cell 
surface  proteins  exhibiting  restricted  tissue  distribution  and  apparent  tissue  modulation.  Expression  profiling  and  ^-scintigraphic 
imaging  with  antibodies  establishes  two  of  these  proteins,  aminopeptidase-P  and  annexin  A1,  as  selective  in  vivo  targets  for 
antibodies  in  lungs  and  solid  tumours,  respectively.  Radio-immunotherapy  to  annexin  A1  destroys  tumours  and  increases  animal 
survival.  This  analytical  strategy  can  map  tissue-  and  disease-specific  expression  of  endothelial  cell  surface  proteins  to  uncover 
novel  accessible  targets  useful  for  imaging  and  therapy. 


New  targets  are  needed  for  detecting  disease  through  molecular 
imaging1^1  and  for  treating  disease  through  directed  delivery  in 
vivo*"*.  Sequencing  the  human  genome  has  identified  a  target  pool 
of  40,000  genes  which  may  translate  into  a  million  possible  protein 
targets10.  Genomic  and  proteomic  analysis  of  normal  and  diseased 
tissues  has  yielded  thousands  of  genes  and  gene  products  for 
diagnostic  and  tissue  assignment  as  well  as  potential  therapeutic 
targets5-11.  But  the  shear  number  of  candidates  can  overwhelm 
the  required  in  vivo  validation  process,  leading  some  to  question 
the  ultimate  impact  of  these  approaches  on  speeding  up  drug 
discovery5-7,10.  Reducing  tissue  data  complexity  to  a  manageable 
subset  of  candidates  most  relevant  to  targeting,  imaging  and 
treating  disease  is  clearly  desired  but  requires  new  discovery  and 
validation  strategies,  that  effectively  focus  the  power  of  global 
identification  technologies. 

Selectively  targeting  a  single  organ  or  diseased  tissue  (such  as  a 
solid  tumour)  in  vivo  remains  a  desirable  but  elusive  goal  for 
molecular  medicine12,13.  This  targeting  would  allow  more  effective 
imaging  as  well  as  drug  and  gene  therapies  for  many  acquired 
and  genetic  diseases1"4,6,7,10"13.  Most  tissue-  and  disease-associated 
proteins  are  expressed  by  cells  inside  tissue  compartments  not 
readily  accessible  to  intravenously  injected  biological  agents  such 
as  antibodies.  This  inaccessibility  hinders  many  site-directed  thera¬ 
pies5-7,12,13  and  imaging  agents1^1.  For  example,  multiple  barriers  to 
delivery  prevent  effective  immunotherapy  for  solid  tumours  in  vivo , 
despite  efficacy  and  specificity  in  vitro12-17.  Conversely,  the  nearly 
universal  access  of  chemotherapeutics  dilutes  efficacy  requiring 
increased  dosages,  leading  to  unwanted  systemic  side  effects. 
Thus,  new  approaches  are  required  to  filter  the  overabundance  of 
molecular  information  to  allow  rapid  discovery  and  validation  of 
accessible  tissue-specific  targets  that  can  direct  molecular  imaging 
and  pharmacodelivery  in  vivo. 

Here,  we  divert  analytical  focus  from  the  poorly  accessible  cells 
inside  tissues  to  the  critical  tissue-blood  interface,  inherently 
accessible  to  agents  absorbed  or  injected  into  the  blood.  The 
endothelium  is  a  minor  tissue  component  that  exists  as  an  attenu¬ 
ated  cell  monolayer  lining  all  blood  vessels  and  is  in  direct  contact 
with  the  circulating  blood.  The  tissue  microenvironment  surround¬ 
ing  blood  vessels  seems  to  control  the  endothelial  cell  phenotype 


in  vivo18-22.  But  the  degree  to  which  normal  and  diseased  tissues  can 
modulate  endothelial  cell  surface  protein  expression  is  unknown. 
Indirect  evidence  of  endothelial  cell  molecular  heterogeneity  comes 
from  the  reported  ability  of  certain  cells  and  peptide  sequences  to 
home  to  specific  tissues  after  intravenous  injection23-25.  Several 
receptors  to  angiogenic  factors  have  been  identified26-28  and  geno¬ 
mic  analysis  of  endothelial  cells  isolated  from  tumours  has  provided 
angiogenic  markers  of  unknown  subcellular  localization2*.  Geneti¬ 
cally  induced  endothelial  cell  targets  provide  ‘proof  of  principle’ 
that  immuno-targeting  endothelium  can  cause  tumour 
regression30;  however,  few  probes  to  native  endothelial  cell  surface 
proteins  show  validated  tissue-specific  pharmacodelivery 
in  Wvo14,15,17,24,31,32.  This  paucity  of  molecular  information  may 
reflect  the  lack  of  tissue-induced  proteins  or,  perhaps  more  likely, 
the  difficulty  in  studying  such  a  minor  cell  type  as  it  exists  natively  in 
the  tissue  without  tissue  disruption  and  isolation.  Although  gene 
and  immune  targeting  of  tumour  neovasculature  seems  promis¬ 
ing33,34,  the  ultimate  potential  of  endothelial  targeting  in  creating 
new  drug  delivery  systems  remains  unclear16,17.  The  actual  existence 
of  tissue-specific  endothelial  cell  surface  targets  will  remain  uncer¬ 
tain  until  we  overcome  current  technical  barriers  to  comprehen¬ 
sively  map  and  validate  tissue-induced  endothelial  cell  surface 
proteins  in  vivo. 

Molecular  diversity  of  endothelium  in  vivo 

To  investigate  endothelial  cell  surface  heterogeneity  in  vivo ,  we 
performed  subcellular  fractionation  to  isolate  silica-coated  luminal 
endothelial  cell  plasma  membranes  and  their  caveolae  directly  from 
normal  organs35,36.  These  plasma  membranes  and  their  caveolae 
displayed  a  greater  than  20-fold  enrichment  for  endothelial  cell 
surface  or  caveolar  markers  (angiotensin  converting  enzyme  (ACE), 
vascular  endothelial  (VE)-cadherin  and  caveolin-1),  whereas  mar¬ 
kers  of  intracellular  organelles  (for  example,  the  Golgi  protein  @- 
COP),  other  tissue  cells  (for  example,  E-cadherin  for  epithelium, 
fibroblast  surface  protein)  and  blood  (for  example,  glycophorin  A, 
CD4  and  CD11)  were  greater  than  20-fold  depleted  (Fig.  lc;  data 
not  shown  and  as  shown  previously1*35,36).  This  quality  control 
analysis  was  applied  to  each  isolate. 

We  analysed  these  isolated  plasma  membranes  by  two- 
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Figure  1  Molecular  heterogeneity  of  endothelial  cell  surface  in  multiple  rat  organs  in  vivo. 
a,  Silver-stained  2D  gels  of  proteins  from  the  whole  tissue  homogenates  (H)  and  the 
isolated  luminal  cell  endothelial  plasma  membranes  (P).  b.  Comparison  of  representative 
region  of  2D  gels  of  endothelial  plasma  membrane  from  the  indicated  rat  tissues.  Green, 
protein  spots  found  in  >2  tissues;  Red,  spots  not  detected  elsewhere,  c,  Western  blot 
analysis  of  proteins  from  endothelial  plasma  membranes  (1 0  ng)  from  multiple  rat  organs 


using  the  antibodies  to  the  indicated  proteins.  ACE,  angiotensin  converting  enzyme;  APP, 
aminopeptidase  P;  AQP1 ,  aquaporin  1 ;  CA,  carbonic  anhydrase;  DPPIV,  dipeptidyl 
peptidase  IV;  ECE,  endothelin  converting  enzyme;  RAGE,  receptor  for  advanced  glycation 
end  products;  STR,  seven  transmembrane  receptor;  TM,  thrombomodulin.  3 -COP. 
coatomer  beta  subunit. 


dimensional  (2D)  gel  electrophoresis  to  produce  high-resolution 
vascular  endothelial  protein  maps  of  the  major  rat  organs,  that  were 
distinct  and  much  reduced  in  complexity  from  that  of  the  starting 
tissue  homogenate  (Fig.  la).  Differential  spot  analysis  detected 
many  distinct  proteins  in  plasma  membrane  versus  the  homogenate 
and  also  in  the  plasma  membrane  between  organs  (Fig.  lb).  Western 
blot  analysis  further  confirmed  this  heterogeneity  (Fig.  lc;  data  not 
shown).  In  many  cases,  antigens  difficult  to  detect  in  tissue  homo¬ 
genates  were  readily  apparent  in  the  isolated  plasma  membranes, 
reflecting  the  significant  enrichment  and  increased  sensitivity  pro¬ 
vided  by  subfractionation  to  unmask  proteins  located  on  the 
endothelial  cell  surface.  Unique  ‘molecular  fingerprints  or  signa¬ 
tures’  that  may  include  tissue-  and  cell-specific  proteins  were 
apparent  for  each  endothelium. 

Subtractive  analysis  and  profiling 

To  identify  specific  proteins  expressed  at  the  endothelial  cell  surface 
we  used  mass  spectrometry,  database  searching  and  immuno- 
blotting,  to  analyse  endothelial  plasma  membranes  and  caveolae 
isolated  from  rat  tissues35,36,  as  well  as  cultured  rat  lung  micro- 
vascular  endothelial  cells  (RLMVEC),  isolated  and  grown  in  cul¬ 
ture37.  So  far,  we  have  identified  nearly  2,000  non-redundant 
proteins  (our  unpublished  observation)  which  were  entered  into  a 
database  (Accessible  Vascular  Targets,  AVATAR)  that  was  designed 
and  annotated  to  provide  relational  analysis  between  measurements 
and  samples.  To  identify  lung-induced,  and  possibly,  lung-specific 
endothelial  cell  surface  proteins  we  used  a  systems  biology 
approach,  based  on  the  hypothesis  that  the  surrounding  tissue 
microenvironment  (normal  or  diseased)  modulates  protein 
expression  in  the  vascular  endothelium.  We  subtracted,  in  silicoy 
the  proteins  identified  in  endothelial  plasma  membranes  from  rat 
lungs  versus  RLMVEC  because  tissue-dependent  protein  expression 
may  be  reduced  in  cell  culture  that  cannot  yet  duplicate  conditions 
in  vivo.  To  increase  certainty  of  expression  in  vivo  versus  in  vitro ,  we 
applied  a  strict  identification  criterion  of  ^6  tandem  mass  spec¬ 
trometry  (MS/MS)  spectra  to  rat  lung  endothelial  plasma  mem¬ 
branes  versus  low  stringency  of  ^  1  MS/MS  spectra  to  RLMVEC 
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Figure  2  In  wVo  -y  -scintigraphic  imaging  of  APP  antibody  targeting  in  normal  and 
tumour-bearing  rats.  a.  b,  Planar  images  collected  20  min  after  an  intravenous  injection 
of  125I-APP  monoclonal  antibody  into  normal  female  rats  (a)  and  female  rats  bearing 
metastatic  breast  adenocarcinoma  in  the  lung  (b).  c,  d,  SPECT  imaging  using  a  parallel 
hole  collimator  of  female  normal  rats  (c)  or  rats  bearing  breast  adenocarcinoma  tumours 
in  their  lungs  (d)  20  min  after  tail  vein  injection  with  125I-APP  antibodies  (50  jiCi  of  IgG  at 
lOCig-1).  Representative  re-projected  frames  at  the  indicated  degree  of  rotation  are 
shown.  Arrowhead  in  (a)  denotes  heart  cavity. 
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plasma  membranes.  From  the  current  rat  database  of  approximately 
40,000  entries,  we  identified  37  differentially  expressed  proteins.  To 
meet  our  objective  of  direct  exposure  to  circulating  antibodies,  we 
applied  bioinformatic  interrogation  of  structure,  glycosylation  and 
membrane  orientation  to  reduce  the  list  to  1 1  proteins  likely  to  have 
domains  outside  the  cell. 

To  confirm  the  original  mass  spectrometry  findings  and  to  assess 
organ-specificity,  we  performed  expression  profiling  with  specific 
antibodies.  Consistent  with  our  hypothesis,  western  blot  analysis 
detected  all  1 1  proteins  enriched  in  lung  endothelial  plasma 
membranes  (Fig.  lc)  but  not  in  RLMVEC  plasma  membranes 
(data  not  shown).  Each  of  these  proteins  exhibited  a  different 
pattern  of  restricted  expression.  Two  proteins,  aminopeptidase  P 
(APP)  and  OX-45,  were  detected  only  in  endothelial  plasma 
membranes  from  lung.  Staining  of  tissue  sections  with  antibodies 
(including  to  APP  and  OX-45)  confirmed  this  restricted  expression 
(data  not  shown).  This  profiling  by  western  blot  analysis  was  very 
sensitive  and  reliable,  even  at  low  expression  levels.  Thus  subtractive 


proteomic  analysis  provided  a  subset  of  endothelial  cell  proteins  not 
expressed  in  vitro  and  differentially  expressed  in  vivo ,  apparently 
regulated  by  the  unique  tissue  microenvironment  in  e^ch  organ. 

Lung-specific  targeting  in  vivo 

To  assess  the  immuno-accessibility  of  APP  and  OX-45  in  vivo, 
including  possible  lung-specific  targeting,  we  intravenously  injected 
12 1 -labelled  monoclonal  antibodies  into  rats  and  performed  whole- 
body  imaging  using  planar  'y -scintigraphy.  Within  20  min,  we 
observed  clear  lung  images,  indicating  rapid  and  specific  targeting 
of  APP  antibody  to  the  lung  (Fig.  2a,  c).  Region-of- interest  and 
biodistribution  analysis  confirmed  this  lung-specific  targeting 
with  65%  of  the  injected  dose  per  gram  of  tissue  accumulating 
in  the  lungs  and  less  than  2%  injected  dose  per  gram  of  tissue  in 
other  organs  and  blood.  Consistent  with  the  low  blood  levels,  the 
heart  cavity  was  readily  apparent  (Fig.  2a,  arrowhead).  Further¬ 
more,  this  lung  targeting  was  inhibited  by  addition  of  a  50-fold 
excess  of  unlabelled  APP  IgG  but  not  with  control  IgG  (data  not 
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Figure  3  Mapping  rat  lung  tumours  and  organs  for  differential  endothelial  cell  surface 
protein  expression  in  vivo.  a.  Image  of  representative  lungs  with  multiple  solid  tumours, 
b,  2D  gel  analysis  of  silica-coated  luminal  endothelial  plasma  membranes  isolated  from 
breast  adenocarcinoma  tumour-bearing  lungs  of  three  different  animals,  c,  2D  gel 
analysis  of  endothelial  plasma  membranes  from  normal  rat  lungs  (left)  and  lungs  bearing 
breast  adenocarcinoma  tumours  (right).  A  colour  difference  map  of  four  small  gel 
regions  between  normal  and  tumour  endothelial  plasma  membrane  proteins  shows 
relative  abundance  of  protein  spots.  Green,  tumour  endothelial  plasma 
membranes  >  normal  endothelial  plasma  membranes.  Yellow,  tumour  endothelial 
plasma  membranes  <  normal  endothelial  plasma  membranes.  Red.  only  in  the  tumour 
endothelial  plasma  membranes.  Red-edged,  tumour  endothelial  plasma 


membranes  =  normal  endothelial  plasma  membranes,  d.  2D  gel  analysis  of  tumour 
homogenate  (H)  (left)  and  endothelial  plasma  membranes  (P)  (right).  Yellow,  protein  spots 
enriched  in  the  homogenate  over  plasma  membranes.  Green,  protein  spots  enriched  in 
plasma  membrane  over  homogenate.  Blue,  detected  only  in  tumour  endothelial  plasma 
membranes  and  not  homogenate,  e,  Western  blot  analysis  of  proteins  (1 0  ^g)  from  rat 
lung  homogenate  (H)  and  endothelial  plasma  membranes  (P)  from  normal  organs  or 
tumour-bearing  rat  lungs  using  antibodies  to  the  indicated  proteins.  RBC,  red  blood  cell; 
WBC,  white  blood  cell;  M4>.  macrophage;  5  NT,  5'  nucleotidase;  GlyA,  glycophorin  A; 
FSP,  fibroblast  surface  protein;  VEGF  R,  vascular  endothelial  growth  factor  receptor;  APN, 
aminopeptidase  N;  MPO.  myeloperoxidase;  TfnR,  transferrin  receptor;  VitDBP,  vitamin  D 
binding  protein. 
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Figure  4  Tumour-specific  targeting  of  125l-AnnA1  antibodies  after  intravenous  injection 
into  tumour-bearing  rats,  a,  Whole-body  planar  -y  -scintigraphic  imaging  4  h  after  tail 
vein  injection  of  125l-AnnA1  antibodies  (50fiCi  of  IgG  at  lOCig^1)  into  lung-tumour- 
bearing  female  rats,  b,  125l-AnnA1  antibody  signal  superimposed  onto  photo  of 
experimental  animal  lying  on  the  detector  plate,  c.  d.  Tumour-bearing  lungs  were  excised 
for  imaging  ex  vivo.  Digital  image  of  excised  lungs  showing  location  of  tumours,  circled  in 
yellow  (c).  Overlay  of  planar  images  of  tumour  hot  spots  with  excised  tumour-bearing 
lungs  (d). 


shown),  showing  the  specific  ability  of  our  antibodies  to  bind  APP 
exposed  on  the  endothelial  cell  surface  of  the  lung  but  not  other 
organs. 

Consistent  with  OX-45  expression  in  various  leukocytes38,  we 
detected  OX-45  (but  not  APP)  in  blood.  Thus,  imaging  with  the 
OX-45  antibody  showed  little  targeting  to  the  lung  (data  not 
shown).  Biodistribution  analysis  confirmed  only  2%  of  the 
injected  dose  per  gram  of  tissue  accumulating  in  the  lung  and 
greater  than  70%  of  the  injected  dose  per  gram  of  tissue  remaining 
in  the  blood,  primarily  bound  to  the  buffy  coat  subfraction  of  the 
blood. 

APP  was  reported  to  be  expressed  on  mouse  blood  vessels  of 
normal  breast  and  mammary  adenocarcinomas  as  a  target  for  a 
homing  peptide39.  High  resolution  single  photon  emission  com¬ 
puted  tomography  (SPECT)  imaging  of  I25I-APP  antibodies 
injected  intravenously  into  female  rats  detected  no  accumulation 
in  normal  breast  tissue  (Fig.  2a)  or  in  primary  or  metastatic  breast 
tumour  lesions  (Fig.  2b;  and  multiple  images  taken  over  36  hours 
(data  not  shown)).  SPECT  imaging  provided  a  clear  three-dimen¬ 
sional  view  of  normal  lungs  (Fig.  2c)  which  was  in  striking  contrast 
to  the  irregular  images,  with  multiple  nodular  regions  lacking 
signal,  in  rats  with  mammary  adenocarcinomas  in  the  lung 
(Fig.  2d).  Thus,  based  on  the  immuno-targeting  imaged  in  vivo> 
endothelial  APP  expression  in  the  rat  appeared  quite  specific  for 
normal  lung  tissue,  apparently  induced  by  conditions  present  in  the 
normal  lung  tissue  microenvironment,  but  absent  in  the  tumour 


milieu  in  vivo ,  even  when  the  tumour  blood  vessels  are  derived  from 
the  normal  lung  vasculature. 

Tumour-induced  endothelial  cell  proteins 

To  determine  whether  the  tumour  microenvironment  in  the  lung  is 
sufficiently  different  to  induce  new  endothelial  protein  expression, 
we  isolated  silica-coated  luminal  endothelial  cell  plasma  mem¬ 
branes  and  their  caveolae  from  normal  rat  lungs  and  lungs  bearing 
breast  adenocarcinomas33,36  (Fig.  3a).  As  above,  these  isolates  were 
significantly  enriched  relative  to  the  tissue  homogenates  in  endo¬ 
thelial  cell  surface  markers  (caveolin,  5'  nucleotidase,  ACE  and 
VE-cadherin)  and  markedly  depleted  in  markers  of  possible  con¬ 
taminants  (including  3 -COP,  CD4,  CD1 1,  glycophorin  A,  fibroblast 
surface  protein  and  galectin-1,  that  is  expressed  by  the  tumour 
cells40)  (Fig.  3e  and  data  not  shown).  The  endothelial  plasma 
membranes  isolated  from  tumours  expressed  multiple  angiogenesis 
markers  (enriched  relative  to  both  tumour  homogenates  and 
normal  lung  endothelial  cell  plasma  membranes).  Lastly,  markers 
of  immune  cells  known  to  infiltrate  solid  tumours  were  detected  in 
tumour  homogenates  but  not  its  endothelial  cell  plasma 
membranes. 

We  used  2D  gels  to  visualize  several  hundred  protein  spots  in 
endothelial  cell  plasma  membranes  from  normal  lungs  versus 
tumours  in  lungs.  These  maps  were  reproducible  (Fig.  3b).  Multiple 
protein  spots  were  detected  in  these  tumour  but  not  normal 
endothelial  plasma  membranes  (Fig.  3c).  Prominent  2D  spots  easily 
detected  in  tumour  plasma  membranes  were  not  detected  in  the 
homogenates  (Fig.  3d),  consistent  with  the  small  percentage  of 
endothelial  cell  plasma  membranes  in  the  tumours.  Tissue  sub¬ 
fractionation  appeared  necessary  to  unmask  differentially  expressed 
tumour  vascular  proteins  obscured  by  the  molecular  complexity  of 
the  total  tumour. 

Again,  we  applied  a  subtractive  proteomic  approach  using 
antibody  and  mass  spectrometric  analysis  of  these  plasma  mem¬ 
branes  and  caveolae  isolated  from  them  to  identify  15  differentially 
expressed  proteins  (Fig.  3e),  including  proteins  already  implicated 
in  tumour  angiogenesis:  vascular  endothelial  growth  factor  ( VEGF) 
receptors- 1  and  -2;  Tie2;  aminopeptidase-N;  endoglin;  carcino- 
embryonic  antigen-related  cell  adhesion  molecule  1  (CD66)' 
(C-CAM-1)  and  neuropilin-127,28.  These  proteins  were  enriched  in 
the  tumour  endothelial  plasma  membranes  relative  to  tumour 
homogenates,  consistent  with  proper  subfractionation.  Eight  new 
tumour- induced  vascular  proteins  were  also  identified:  annexin  Al; 
annexin  A8;  ephrin  A5;  ephrin  A7;  myeloperoxidase;  nucleolin; 
transferrin  receptor  and  vitamin  D-binding  protein.  Consistent 
with  the  subtractive  screen  hypothesis,  12  out  of  15  proteins  were 


Days  after  tumour  cell  Inoculation  Days  after  tumour  cell  inoculation 


Figure  5  AnnAI  targeted  radio-immunotherapy  increases  survival  in  rats.  Female  Fisher 
rats  were  injected  intravenously  with  1 3762  cells  to  induce  lung  tumours  (Day  0). 
a.  Kaplan-Meier  survival  curve  comparing  the  survival  of  tumour-bearing  rats  injected  1 5 
days  after  tumour  cell  inoculation  with  125l-AnnA1  antibodies  (50  ^g  of  IgG  at 
10  »xCi  M-g _1.  red  line;  o  =  10)  versus  control  non-targeting  125l-lgG  (blue  line;  n  =  10) 


versus  untreated  animals  (green  line;  n  =  3).  b,  Body  weights  of  rats  were  measured  daily 
and  plotted.  Red,  tumour-bearing  rats  treated  with  125l-AnnA1  antibodies;  blue,  tumour¬ 
bearing  rats  treated  with  control  125l-lgG;  green,  untreated  tumour-bearing  animals; 
black,  untreated  normal  rats,  n  =  3.  Maximum  standard  deviation  <  1 0  g  for  each  weight 
plotted. 
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much  more  evident  in  tumour  endothelial  plasma  membranes  than 
in  normal  plasma  membranes. 

Expression  profiling  using  endothelial  plasma  membranes  iso¬ 
lated  from  major  organs  demonstrated  that  almost  all  of  these 
proteins  exist  at  the  endothelial  cell  surface  of  at  least  one  major 
organ,  albeit  mostly  at  levels  much  less  than  tumours  (Fig.  3e). 
One  promising  tumour  candidate  target  was  the  34  KDa  protein 
recognized  by  annexin  A1  (AnnAl)  antibodies  only  in  tumour 
endothelial  plasma  membrane.  Tissue  immuno-histochemistry 
confirmed  tumour  blood  vessel  reactivity  (data  not  shown).  Thus, 
the  tumour  microenvironment  appeared  to  induce  distinct  protein 
expression  on  the  endothelial  cell  surface. 

Targeting  and  imaging  of  solid  tumours 

Annexins  are  cytosolic  proteins  that  can  associate  with  cell  mem¬ 
branes  in  a  calcium-dependent  manner41.  Some  annexins  may 
translocate  the  lipid  bilayer  to  the  external  cell  surface41.  To  test 
whether  AnnAl  is  sufficiently  exposed  and  tumour  vessel  specific 
to  permit  immuno-targeting  in  vivo ,  we  performed  whole-body 
imaging  using  1 23I-labelled  AnnAl  monoclonal  antibodies,  ^-scinti¬ 
graphic  planar  images  captured  four  hours  postinjection  showed 
a  distinct  focus  of  radioactivity  in  the  lung  and  little  signal  else¬ 
where,  in  the  bodies  of  rats  with  tumours  (Fig.  4a,  b).  Non-targeting 
l25I-labelled  IgG  did  not  show  detectable  selective  tumour  uptake 
(data  not  shown).  When  the  lungs  were  imaged  ex  vivo,  we  observed 
l25I- AnnAl  antibody  accumulation  in  the  tumour  as  hot  spots 
corresponding  to  visible  tumours  (Fig.  4c,  d).  Furthermore,  target¬ 
ing  was  prevented  by  a  30-fold  excess  of  unlabelled  AnnAl  IgG  but 
not  by  control  IgG  (data  not  shown).  Region-of-interest  and 
biodistribution  analysis  confirmed  targeting  in  vivo  with  an  average 
tumour  accumulation  at  two  hours  of  34%  injected  dose  per  gram 


of  tissue.  This  compared  favourably  to  VEGF  receptor  antibodies 
which  accumulated  at  6.4%  injected  dose  per  gram  of  tumour. 
When  injected  into  rats  without  tumours,  125I-AnnAl  antibodies 
showed  no  targeting  of  normal  organs,  including  lung  at  levels  less 
than  1%  injected  dose  per  gram  of  tissue,  whereas  VEGF  receptor 
antibody  accumulation  was  greater  in  multiple  organs  (data  not 
shown).  The  uptake  ratio  in  rat  tumour-bearing  lungs  versus 
normal  lungs  was  up  to  2.0  and  70  for  antibodies  to  VEGF  receptor 
and  AnnAl,  respectively.  Thus,  7 -scintigraphic  imaging  rapidly 
validated  AnnAl  as  a  tumour  target  that  is  readily  accessible  to 
antibody  injected  intravenously  for  tumour  targeting  and  imaging 
in  vivo.  AnnAl  appeared  to  be  selectively  externalized  on  the 
endothelial  cell  surface  in  the  solid  tumours. 

Radio-immunotherapy  of  solid  tumours 

Many  tumour-bearing  rats  imaged  with  the  ,25I- AnnAl  antibody 
survived,  so  we  performed  a  survival  study  and  recorded  animal 
body  weights;  80%  of  the  animals  treated  with  the  l25I- AnnAl 
antibody  survived  8  days  or  longer  compared  with  the  ,25I-IgG- 
treated  and  untreated  rats,  which  died  within  7  days  (Fig.  5a).  The 
body  weights  of  all  tumour-bearing  rats  began  to  drop  7-10  days 
after  tumour  cell  inoculation  (Fig.  5b).  The  control  rats  continued 
to  decrease  in  body  weight  to  23-30%  less  than  normal  at  their 
death.  In  contrast,  rats  treated  with  l25I- AnnAl  IgG  began  to  gain 
weight  within  3^1  days  and  reached  a  normal  body  weight  after  25 
days.  This  increased  survival  was  striking  because  in  this  model 
many  animals  die  within  2-4  days  of  treatment  and  thus  may  lack 
sufficient  time  to  benefit  from  treatment.  The  survival  rate  of  rats 
surviving  the  first  week  approached  90%.  The  one  rat  that  died  after 
2  weeks  required  euthanasia  because  of  a  leg  tumour  and  large  tail 
tumour  that  were  not  apparent  when  treated.  Thus,  a  single 


Human  tumours  Adjacent  normal  tissue 


Figure  6  Expression  of  AnnAl  on  vascular  endothelium  of  multiple  primary  human  solid 
tumours,  a.  Immuno-staining  of  paraffin -embedded  (prostate)  or  frozen  (lung,  liver) 
sections  from  human  tumour  specimens  and  normal  tissues,  as  indicated,  with  antibodies 
to  AnnAl  (top  panels)  and  PECAM  (bottom  panels).  Arrowheads  indicate  blood  vessels. 


Scale  bar  represents  10^m.  b,  Low  magnification  images  of  human  lung  and  breast 
tumour  specimens  stained  with  antibodies  to  AnnAl  or  PECAM,  as  indicated.  Scale  bar 
represents  100  pm. 
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injection  of  l25I- AnnAI  antibody  caused  significant  remission  even 
in  advanced  disease. 

AnnAI  in  human  tumour  neovasculature 

We  immuno-stained  tissue  sections  of  human  solid  tumours. 
AnnAI  antibody  labelled  blood  vessels  of  human  prostate,  liver, 
breast  and  lung  tumours  but  not  matched  normal  tissues  (Fig.  6). 
Antibodies  to  platelet  endothelial  cell  adhesion  molecule- 1 
(PECAM)  stained  both  normal  and  tumour  blood  vessels.  The 
lack  of  AnnAI  expression  in  vascular  endothelium  of  multiple 
normal  organs  has  been  reported  previously42-45.  Thus,  AnnAI 
was  selectively  detected  on  the  neovascular  endothelium  of  multiple 
human  solid  tumours. 

Discussion 

We  describe  a  new  logic-based  discovery  and  validation  approach 
that  integrates  newly  developed,  global  analytical  techniques  to  map 
the  proteins  expressed  in  vivo  at  the  luminal  endothelial  cell  surface. 
This  approach  has  demonstrated  distinct  molecular  signatures  in 
normal  and  neoplastic  tissues.  We  applied  this  strategy  to  rat  lungs 
and  solid  tumours  to  uncover,  from  the  vast  number  of  proteins 
expressed  in  tissue,  approximately  50  differentially  expressed  pro¬ 
teins,  including  two  promising  tissue-selective  endothelial  cell  sur¬ 
face  proteins  that  permitted  rapid  and  specific  immuno-targeting 
and  imaging  in  vivo.  Profiling  the  endothelial  cell  surface  that  is 
accessible  in  vivo  is  an  important  logical  step  for  validating  the 
hypothesis  of  tissue-modulated  endothelial  cell  diversity  in  vivo  and 
for  discovering  potential  targets.  This  is  important  not  only  to  direct 
pharmacodelivery  and  molecular  imaging  but  also— by  means  of 
caveolae— to  overcome  the  restrictive  endothelial  cell  barrier  to 
transport  drugs  or  genes  to  the  underlying  tissue  cells16,17. 

Key  features  of  our  analysis  included  tissue  subfractionation  with 
subtractive  proteomic  and  bioinformatic  filters  that  together 
reduced  tissue  complexity  by  greater  than  five  orders  of  magnitude, 
and  unmasked  a  manageable  subset  of  proteins  at  the  inherently 
accessible  blood-tissue  interface.  This  has  led  to  the  creation  of  a 
database  of  vascular  endothelial  cell  surface  proteins  containing 
accessible  proteins  apparently  modulated  by  the  tissue.  After  bio¬ 
informatic  filtering,  western  blot  analysis  and/or  tissue  immuno- 
staining  provided  sensitive  expression  profiling  to  yield  only  a  few 
candidate  proteins  exhibiting  organ-  or  tumour-specificity.  Planar 
and  SPECT  imaging,  using  intravenously  injected  antibodies 
specific  to  each  protein,  visualized  targeting  and  tissue  accumu¬ 
lation  with  high  sensitivity  and  resolution.  Imaging  thus  provided  a 
rigorous  and  objective  validation  of  accessibility  and  tissue- 
specificity  of  the  few  remaining  candidate  targets  and  ultimately 
demonstrated  potential  utility  such  as,  how  rapidly  and  to  what 
extent  the  antibody  targets  single  organs  or  solid  tumours  in  vivo. 

So  far,  in  vivo  imaging  correlates  well  with  the  expression 
profiling  using  isolated  endothelial  plasma  membranes.  It  showed 
lung-specific  immuno-targeting  for  APP  (Fig.  2),  tumour- 
specificity  for  AnnAI  (Fig.  4)  and  targeting  of  multiple  tissues  for 
less  specific  target  proteins  (that  is,  PV-1,  VEGF-R  and  podoca- 
lyxin)  (our  unpublished  observations).  Because  of  poor  access 
inside  many  tissues,  antibodies  injected  intravenously  usually 
require  significantly  higher  doses  (^250  pg  kg  1  (ref.  46)  compared 
with  20  pgkg  1  used  here)  to  get  a  small  percentage  into  the  tissue 
that  binds  and  can  then  be  visualized  clearly  a  day  or  so  later,  after 
the  background  signal  has  been  cleared  from  the  blood.  Here,  the 
binding  was  direct  and  unhindered  by  barriers  so  that  both  tissue 
accumulation  and  blood  clearance  were  rapid,  thereby  providing 
striking  images  within  minutes  to  hours.  The  rapidity  and  high 
levels  of  specific  targeting  observed  here  meet  the  theoretical 
expectation  of  the  vascular  targeting  strategy17.  Although  fast 
emerging  as  a  standard  for  assessing  targeting  and  specificity1-4, 
whole-body  imaging  may  be  underutilized  in  target  validation 
despite  being  non- invasive  and  highly  sensitive. 
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Using  a  new,  integrated  analytical  approach  we  have  found  new 
tissue-specific  endothelial  cell  surface  proteins  that  can  act  as 
vascular  targets  to  bring  us  one  step  closer  to  achieving  the 
century-old  elusive  goal  of  targeting  single  organs  or  solid  tumours 
in  vivo1,2,4,6,10.  The  development  and  optimization  of  several  key 
technologies  has  provided  the  sensitivity  and  throughput  required 
for  this  new  target  discovery  and  validation  process.  We  expect  that 
site-directed  vascular  and  caveolar  targeting  will  benefit  both  drug 
and  gene  delivery  in  the  treatment  of  many  diseases16.  We  have 
demonstrated  here,  using  a  rat  tumour  model  that  monoclonal 
antibodies  to  AnnAI  can  effectively  direct  low  levels  of  radio¬ 
nuclides  (lOOpCi)  to  concentrate  in,  and  thus  destroy  solid 
tumours,  which  ultimately  increases  animal  survival.  Because 
AnnA  1  is  also  selectively  detected  in  multiple  human  solid  tumours, 
this  target  may  similarly  help  to  image  and  treat  human  disease.  □ 

Methods 

A  detailed  description  of  the  materials,  2D  gel  analysis,  expression  protiling  in  vivo  of 
candidate  proteins,  rat  tumour  models,  -y -scintigraphic  imaging  and  biodistribution 
analysis  is  available  in  Supplementary  Methods. 

Proteomic  analysis 

We  used  a  three-pronged  approach  to  resolve  proteins/ peptides  for  mass  spectrometric 
analysis:  ( 1 )  The  proteins  of  silica-coated  luminal  endothelial  cell  plasma  membranes  and 
caveolae  isolated  from  various  normal  organs  or  tumours  were  resolved  by  high  resolution 
2D  gel  analysis  before  excising  potential  organ-  or  tumour-specitic  protein  spots  for  mass 
spectrometric  analysis;  (2)  Proteins  in  the  isolated  endothelial  plasma  membranes  and 
caveolae  were  separated  using  one-dimensional  (ID)  SDS-polyacrylamide  gel 
electrophoresis  (SDS-PAGE)  and  specific  bands  of  interest  were  cut  out  or  the  whole  gel 
lane  was  cut  into  50  slices,  and  the  proteins  in  each  of  the  gel  slices  were  analysed  by  mass 
spectrometry;  and  (3)  Multi-dimensional  Protein  Identification  Technology  (MudPIT) 
was  used  to  analyse  tryptic  peptides  from  a  complex  mixture  of  proteins  extracted  directly 
from  the  whole  isolated  endothelial  plasma  membrane  fraction  and  caveolae  isolates.  Each 
gel  spot/slice  was  de-stained  and  digested  overnight  with  trypsin  before  extracting  the 
cleaved  peptides  from  the  gel  and  then  loading  them  onto  a  reverse  phase  micro-column 
for  gradient  acetonitrile  elution  directly  into  the  mass  spectrometer  (LCQ  Deca  XP  ion 
trap  mass  spectrometer  (ThermoFinnigan)  equipped  with  a  modified  micro-electrospray 
ionization  source  from  Mass  Evolution).  For  MudPIT,  1 50  ng  of  complex  peptide  mixture 
was  separated  by  2D  liquid  chromatography,  comprising  a  micro-column  packed  with 
three  phases  of  chromatographic  material  as  follows:  8.5  cm  of  5fim  reversed  phase 
material  (Polaris  C18-A,  Metachem);  4  cm  of  5 urn  300  A  strong  cation  exchanger 
(PolyLC);  and  lastly  3.5cm  of  C|g  material,  using  a  helium  pressure  cell  operated  at 
600-900  psi  (Mass  Evolution).  Peptides  were  directly  eluted  into  the  mass  spectrometer 
using  2D  chromatography  with  18  step-elutions  from  the  strong  cation  exchanger, 
followed  by  a  gradient  elution  of  the  reversed  phase  material.  Operation  of  the  quarternary 
Agilent  1 100  HPLC  pump  and  the  mass  spectrometer  was  fully  automated  during  the 
entire  procedure  using  the  Excalibur  1.2  data  system  (ThermoFinnigan).  Continuous 
cycles  of  one  full  scan  ( m/z  400  to  1,400)  followed  by  three  data-dependent  MS/MS 
measurements  at  35%  normalized  collision  energy  were  performed.  MS/MS 
measurements  were  allowed  for  the  three  most  intense  precursor  ions  with  an  enabled 
exclusion  list  of  25  m/z  values  (±1.5  Da)  or  a  maximum  time  limit  of  5  min.  The  zoom 
scan  function  to  determine  the  charge  state  was  disabled  in  order  to  increase  the  duty  cycle 
of  the  instrument. 

Database  search  and  in  silico  analysis  of  tandem  mass  spectra 

MS/MS  spectra  were  extracted  from  raw  files  requiring  a  minimum  of  21  signals  with  an 
intensity  of  at  least  4.75  X  10"*  arbitrary  units  (AU).  Extracted  MS/MS  spectra  were 
automatically  assigned  to  the  best  matching  peptide  sequence  using  the  SEQUEST 
algorithm  and  the  Sequest  Browser  software  package  (ThermoFinnigan).  SEQUEST 
searches  were  performed  using  a  rat  protein  database  containing  40,800  protein  sequences 
that  were  downloaded  as  FASTA  formatted  sequences  from  ENTREZ  (National  Center  for 
Biotechnology  Information  (NCBI);  http://www.ncbi.nlm.nih.gov/Entrez).  Sequence 
redundancies  were  removed  using  Perl  script.  The  peptide  mass  search  tolerance  was  set  to 
3  Da.  Spectral  matches  were  retained  with  a  minimal  cross-correlation  score  (XCorr)  of 
1.5, 2.2  and  3.3  for  charge  states  +1,  -1-2  and  +3,  respectively.  DeltaCN  (top  match’s  XCorr 
minus  the  second-best  match’s  XCorr,  divided  by  top  match’s  XCorr)  had  to  be  equal  or 
less  than  0.07.  Retained  spectral  matches  were  filtered  and  re-assigned  to  proteins  using 
DTASelect.  DTASelect  outputs  of  independent  measurements  were  entered  into  the 
AVATAR  database.  AVATAR  was  designed  to  store  a  large  amount  of  mass  spectrometric 
data  and  to  provide  tools  to  analyse  the  data  to  extract  valuable  information.  We  used 
relational  models  for  database  design  based  on  entity  relationship  and  implemented  the 
database  in  the  MySQL  relational  database  management  system  (MySQL)  to  support 
database  query  and  management.  This  relational  database  plus  the  Perl-based  interface 
greatly  improved  data  organization,  data  consistency  and  integrity,  and  facilitated  data 
comparison  and  information  retrieval.  In  the  case  of  the  ID  gel  and  MudPIT  approaches, 
AVATAR  was  used  to  subtract  the  data  to  find  proteins  detected  only  at  the  lung 
endothelial  cell  surface  in  vivo  versus  in  vitro  or  on  the  tumour  but  not  normal 
endothelium. 
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In  silico  bioinformatic  interrogation 

To  identify  possible  candidates  for  intravenously  accessible  targets  from  the  subset  of 
proteins  identified  as  lung  or  tumour  induced,  we  determined  their  currently  known 
membrane-associated  structure  (bilayer  spanning  versus  lipid  anchor  (intra-  or 
extracellular)  versus  peripheral  interaction)  from  scientific  reports  and/or  protein 
databases,  such  as  SwissProt  (http://us.expasy.org/sprot/sprot-top.html)  and  the  NCBI 
ENTREZ  protein  database  (http://www.ncbi.nlm.nih.gov/entrez/query.fcgi).  We  also  used 
web-based  prediction  programs  to  identify  candidates  that  may  harbour  transmembrane 
spanning  alpha  helices  (Prediction  of  Transmembrane  Regions  and  Orientation 
(TMpred),  http://www.ch.embnet.org/software/TMPRED_form.html)  or  glycosylation 
sites  to  indicate  a  possible  ectodomain  exposed  to  the  circulating  blood  (Prosite  Scan, 
http://npsa-pbil.ibcp.fr/cgi-bin/npsa_automat.pfrpage  =  npsa_prosite.html).  Only  100% 
probabilities  were  taken  into  consideration. 
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Supplementary  methods. 

Materials.  Antibodies  were  obtained:  AnnAI  (Cl 9,  polyclonal),  AnnA8,  EphA5,  Eph  A7, 
ACE,  APN,  caveolin-l,  E-cadherin,  Tie  2,  and  VE-Cadherin  from  Santa  Cruz 
Biotechnology  (Santa  Cruz,  CA);  human  AnnAI  (1 1-29,  polyclonal)  from  ICN  Biomedicals 
(Irvine,  CA);  aquaporin-1,  was  obtained  form  BD  Biosciences/Pharmingen  (San  Diego, 
CA);  P-COP,  P-actin.  Md>,  and  fibroblast  surface  protein  from  Sigma  (Saint  Louis,  MO); 
RAGE  from  Affinity  Bioreagents  (Golden,  CO);  TfnR,  VEGF  R2,  and  ECE  were  from 
Zymed  Lab,  Inc.  (San  Francisco,  CA);  nucleolin  from  Leinco  Technologies  (St.  Louis, 
MO);  CD4  from  Serotech  (Raleigh,  NC);  DPPIV  form  BD  Biosciences  (San  Diego,  CA); 
TM  from  Covance  (Princeton,  NJ);  MPO  from  Accurate  Chemicals  (Westbury.  NY),  and 
Vitamin  D  binding  protein  from  DAKO  (Carpinteria,  CA).  Antibodies  against  carbonic 
anhydrase  IV  were  a  kind  gift  of  W.S.  Sly,  St.  Louis  University  (St.  Louis,  MO);  APP,  PV- 
I,  and  podocalyxin  were  produced  in  house;  seven  transmembrane  receptor  was  a  kind  gift 
of  Dr.  Shigehisa  Hirose,  Tokyo  Institute  of  Technology  (Yokohama.  Japan);  OX-45  was  a 
kind  gift  of  Dr.  Neil  Barclay,  University  of  Oxford  (Oxford,  UK);  galectin  1  was  a  kind  gift 
of  Dr.  M.  Huflejt,  Sidney  Kimmel  Cancer  Center  (San  Diego,  CA);  VEGF  receptor  1  was  a 
kind  gift  of  Dr.  D.  Sanger,  Beth  Israel  Deaconess  Medical  Center  (Boston  MA);  endoglin 


was  a  kind  gift  of  Dr.  Yamashita,  Ludwig  Institute  for  Cancer  Research;  neurophilin  1  was 
a  kind  gift  of  Dr.  Ginty,  Johns  Hopkins  School  of  Medicine  (Baltimore.  MD);  C-CAM I 
was  a  kind  gift  of  Sue-Hwa  Lin.  The  University  of  Texas  M.D.  Anderson  Cancer  Center 
(Houston,  TX).  After  testing  commercially  available  AnnAl  monoclonal  antibodies,  we 
found  it  necessary  to  generate  our  own  panel  of  monoclonal  antibodies  using  recombinant 
AnnAl. 

Expression  profiling  in  vivo  of  candidate  proteins.  To  assess  expression  of  candidate 
endothelial  cell  proteins  in  different  tissues,  proteins  from  the  tissue  homogenate  and  P 
isolated  from  normal  rat  lung,  heart,  kidney,  liver,  and/or  tumour-bearing  lungs  were 
solubilized  with  CLB  buffer  (6M  urea,  0.5M  Tris  pH  6.8.  9mM  EDTA,  9%  sodium  dodecyl 
sulfate),  separated  by  SDS-PAGE,  and  electrotransferred  to  nitrocellulose  filters  for 
immunoblotting  with  the  appropriate  antibodies  as  described1. 

Immunohistochemical  staining  of  tissues.  Tissue  samples  were  provided  by  the 
Cooperative  Human  Tissue  Network  which  is  funded  by  the  National  Cancer  Institute. 
Frozen  tissues  were  cut  (5  urn)  on  a  Microm  HM505E  cryomicrotome.  Sections  were  fixed 
with  neutral  buffered  formalin  for  5  min  at  room  temperature  then  incubated  for  one  hour  at 
room  temperature  in  blocking  solution  (5%  FBS,  0.1%  Tween  20  in  PBS).  After  a  2-hour 
incubation  at  room  temperature  in  primary  antibodies  (diluted  in  blocking  solution)  the 
sections  were  washed  then  treated  with  the  appropriate  biotin-conjugated  secondary 
antibody  (KPL  Laboratories,  Gaithersburg,  MD)  for  1  hour  at  room  temperature,  washed 
again  then  treated  with  a  streptav  id  in-conjugated  horseradish  peroxidase  (KPL 
Laboratories)  for  I  hour  at  room  temperature.  Immune  complexes  were  detected  using  a 
Liquid  DAB  staining  kit  from  BioGenex  (San  Ramon,  CA).  Formalin-fixed,  paraffin- 
embedded  sections  (5  pm)  were  cut  on  a  Microm  HM340E  microtome.  Antigen  retrieval 
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was  performed  using  acid  citrate  buffer  following  standard  procedures.  After  the  sections 
were  washed  in  water  to  remove  the  citrate  buffer,  they  were  blocked  and  immunostained 
as  above. 

Rat  tumor  models.  Female  Fisher  rats  (100-150  gms)  were  injected  via  the  tail  vein  with  a 
cell  suspension  of  13762  breast  adenocarcinoma  cells  to  give  ample,  well  circumscribed, 
and  highly  vascularized  tumours  in  the  lung.  To  create  a  maximum  density  of  tumour 
lesions  of  3-8  mm  in  diameter  that  are  clearly  visible  in  the  lungs  (as  shown  in  Fig.  3A),  we 
injected  5  x  105  1  3762  cells  14-15  days  prior  to  perfusion  and  isolation  of  tumour-bearing 
lung  P.  To  obtain  a  few  well-circumscribed  tumours  of  3-6  mm  in  diameter,  we  injected  I  x 
105  cells  21  days  prior  to  performing  the  imaging  experiments. 

Gamma  scintigraphic  imaging  and  biodistribution  analysis.  Monoclonal  antibodies 
were  isolated  using  GammaBind  Plus  Sepharose  (Amersham,  Piscataway,  NJ)  and 
conjugated  to  l23I  using  lodogen  as  described2.  Biodistribution  analysis  was  performed  as 
described2.  Imaging  was  performed  using  an  A-SPECT  imaging  system,  a  dedicated  small 
animal  radiotracer  imaging  system  (Gamma  Medica.  Inc.,  Northridge.  CA)\  fitted  with  a 
parallel-hole  collimator.  Normal  and  tumour-bearing  female  Fisher  rats  were  anaesthetized 
and  injected  via  the  tail  vein  with  l-labeled  monoclonal  antibody  (5  pg  IgG;  10  pCi/pg) 
before  being  subjected  to  planar  gamma  scintigraphic  imaging  captured  over  10  min.  For 
tomographic  studies,  the  images  were  captured  in  6°  increments  for  64  frames  (30 
sec/frame).  After  whole  body  imaging,  in  some  cases,  the  lungs  were  excised  for  planar 
imaging  captured  over  10  min  ex  vivo. 
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